cavendish.hull.sch.uk
robots.txt

Robots Exclusion Standard data for cavendish.hull.sch.uk

Resource Scan

Scan Details

Site Domain cavendish.hull.sch.uk
Base Domain cavendish.hull.sch.uk
Scan Status Ok
Last Scan2025-09-08T09:25:05+00:00
Next Scan 2025-09-22T09:25:05+00:00

Last Scan

Scanned2025-09-08T09:25:05+00:00
URL https://cavendish.hull.sch.uk/robots.txt
Redirect https://www.cavendish.hull.sch.uk/robots.txt
Redirect Domain www.cavendish.hull.sch.uk
Redirect Base cavendish.hull.sch.uk
Domain IPs 109.228.48.205
Redirect IPs 109.228.48.205
Response IP 109.228.48.205
Found Yes
Hash 85b77f5262cd4c0d9199026a129925021bb9fbe5300d69987545c7b4ea2f2e3b
SimHash 39155c13cf91

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://www.cavendish.hull.sch.uk/sitemap.xml