gcu.ac.uk
robots.txt

Robots Exclusion Standard data for gcu.ac.uk

Resource Scan

Scan Details

Site Domain gcu.ac.uk
Base Domain gcu.ac.uk
Scan Status Ok
Last Scan2024-06-22T13:56:34+00:00
Next Scan 2024-07-22T13:56:34+00:00

Last Scan

Scanned2024-06-22T13:56:34+00:00
URL https://gcu.ac.uk/robots.txt
Redirect https://www.gcu.ac.uk/robots.txt
Redirect Domain www.gcu.ac.uk
Redirect Base gcu.ac.uk
Domain IPs 185.64.253.15
Redirect IPs 185.64.253.15
Response IP 185.64.253.15
Found Yes
Hash 772248148debd5693c698929f3fc83b13ce7c393c3296532de75969e135b9961
SimHash 3525600c8ed3

Groups

*

Rule Path
Disallow /_designs/
Disallow /*?sq_content_src=
Disallow /*_recache
Disallow /*_edit
Disallow /*_admin
Disallow /*_login
Disallow /*_performance
Disallow /*_design
Disallow /*_web_services
Disallow /*_feeds
Disallow /search
Disallow /migration/
Disallow /squiz-test/
Disallow /digitaldesign/
Disallow /digital-design/
Disallow /training/
Disallow /gcu-test/
Disallow /gcutest/
Disallow /testing/
Disallow /brandhub/
Disallow /brand-hub/
Disallow /drafts/

Other Records

Field Value
sitemap https://www.gcu.ac.uk/sitemap.xml

Comments

  • Disallow some matrix defaults
  • GCU added
  • Sitemap