highcrest.bucks.sch.uk
robots.txt

Robots Exclusion Standard data for highcrest.bucks.sch.uk

Resource Scan

Scan Details

Site Domain highcrest.bucks.sch.uk
Base Domain highcrest.bucks.sch.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-22T09:46:19+00:00
Next Scan 2026-01-20T09:46:19+00:00

Last Successful Scan

Scanned2023-06-28T09:48:53+00:00
URL https://highcrest.bucks.sch.uk/robots.txt
Domain IPs 109.228.48.205
Response IP 109.228.48.205
Found Yes
Hash c12d9380a4a75c0005b56494e567871dfea40d8660dd0747f10b732d55f8c037
SimHash 39155c11c711

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://highcrestacademy.greenhousecms.co.uk/sitemap.xml