connect.city.ac.uk
robots.txt

Robots Exclusion Standard data for connect.city.ac.uk

Resource Scan

Scan Details

Site Domain connect.city.ac.uk
Base Domain city.ac.uk
Scan Status Ok
Last Scan2025-07-15T02:13:04+00:00
Next Scan 2025-08-14T02:13:04+00:00

Last Scan

Scanned2025-07-15T02:13:04+00:00
URL https://connect.city.ac.uk/robots.txt
Redirect https://connect.citystgeorges.ac.uk/robots.txt
Redirect Domain connect.citystgeorges.ac.uk
Redirect Base citystgeorges.ac.uk
Domain IPs 2.58.104.10, 2.58.104.11
Redirect IPs 2.58.104.10, 2.58.104.11
Response IP 2.58.104.10
Found Yes
Hash 0c8d9ffd8905b040e66a8261a4529616816f851656ccc6601603a20194313f10
SimHash c11816d22493

Groups

*

Rule Path
Disallow /documentation
Disallow /forms
Disallow /staging
Disallow /web-services
Disallow /*?SQ_VARIATION*
Disallow /*?dev=*
Disallow /*?rel=*
Disallow /_media

semrushbot-sa

Rule Path
Allow /

funnelback

Rule Path
Allow /

Other Records

Field Value
sitemap https://connect.citystgeorges.ac.uk/sitemap.xml