highcrest.bucks.sch.uk
robots.txt
Robots Exclusion Standard data for highcrest.bucks.sch.uk
Resource Scan
Scan Details
| Site Domain | highcrest.bucks.sch.uk |
| Base Domain | highcrest.bucks.sch.uk |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Couldn't connect to server. |
| Last Scan | 2025-10-22T09:46:19+00:00 |
| Next Scan | 2026-01-20T09:46:19+00:00 |
Last Successful Scan
| Scanned | 2023-06-28T09:48:53+00:00 |
| URL | https://highcrest.bucks.sch.uk/robots.txt |
| Domain IPs | 109.228.48.205 |
| Response IP | 109.228.48.205 |
| Found | Yes |
| Hash | c12d9380a4a75c0005b56494e567871dfea40d8660dd0747f10b732d55f8c037 |
| SimHash | 39155c11c711 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /admin/ |
| Disallow | /bin/ |
| Disallow | /Connections/ |
| Allow | /i/ |
| Disallow | /inc/ |
| Disallow | /docs/ |
| Disallow | /*.pdf$ |
| Disallow | /*.doc$ |
| Disallow | /*.xls$ |
| Disallow | /*.docx$ |
| Allow | /inc/gallery/ |
| Allow | /i/photos/Gallery/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://highcrestacademy.greenhousecms.co.uk/sitemap.xml |