stjohnscollege.co.uk
robots.txt
Robots Exclusion Standard data for stjohnscollege.co.uk
Resource Scan
Scan Details
Site Domain | stjohnscollege.co.uk |
Base Domain | stjohnscollege.co.uk |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-08-22T07:22:03+00:00 |
Next Scan | 2024-11-20T07:22:03+00:00 |
Last Successful Scan
Scanned | 2022-10-08T04:12:57+00:00 |
URL | https://stjohnscollege.co.uk/robots.txt |
Redirect | https://www.stjohnscollege.co.uk/robots.txt |
Redirect Domain | www.stjohnscollege.co.uk |
Redirect Base | stjohnscollege.co.uk |
Response IP | 185.35.248.205 |
Found | Yes |
Hash | fef6b36c01647a7a7b141070b3c8877c8c4a14c3f9839995abc65cf700e0f4bd |
SimHash | 28009d094775 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 600 |
Comments