cia.org.uk
robots.txt

Robots Exclusion Standard data for cia.org.uk

Resource Scan

Scan Details

Site Domain cia.org.uk
Base Domain cia.org.uk
Scan Status Ok
Last Scan2025-10-16T21:34:34+00:00
Next Scan 2025-11-15T21:34:34+00:00

Last Scan

Scanned2025-10-16T21:34:34+00:00
URL https://cia.org.uk/robots.txt
Redirect https://www.cia.org.uk:443/robots.txt
Redirect Domain www.cia.org.uk
Redirect Base cia.org.uk
Domain IPs 52.223.11.33
Redirect IPs 35.71.136.153, 52.223.11.33
Response IP 35.71.136.153
Found Yes
Hash 4d9ce23679835866289d887bccaab3d9da4ce3178037b21779ad53d2a4440929
SimHash 658d02031914

Groups

*

Rule Path
Disallow /register$
Disallow /subscribe$
Disallow /searchresults
Disallow /solrsearchresults
Disallow /sign-out
Disallow /sign-in
Disallow /attachment$
Disallow /reporttomoderator
Disallow /download$
Disallow /forgotten-password
Disallow /my-account/*
Disallow /my-account
Disallow /*.publicprofile
Disallow /ajax/*
Disallow /api/*
Disallow /mysaved
Disallow /bookmark$
Disallow /*?userid=
Disallow /*?uc=
Disallow /sitedashboard/*
Disallow /CookiePolicy.aspx
Disallow /SystemCheck.aspx
Disallow /Attachments_Advert.aspx
Disallow /navsectionFullRSS.aspx
Disallow /*.fullrss
Disallow /navsectionRSS.aspx
Disallow /*.rss
Disallow /swagger/*

Other Records

Field Value
sitemap https://www.cia.org.uk/GoogleSiteMapIndex.aspx

Comments

  • when go live remove line below
  • Disallow: /
  • when go live uncomment line below