corpuschristi.oldham.sch.uk
robots.txt
Robots Exclusion Standard data for corpuschristi.oldham.sch.uk
Resource Scan
Scan Details
| Site Domain | corpuschristi.oldham.sch.uk |
| Base Domain | corpuschristi.oldham.sch.uk |
| Scan Status | Ok |
| Last Scan | 2025-11-26T19:24:17+00:00 |
| Next Scan | 2025-12-10T19:24:17+00:00 |
Last Scan
| Scanned | 2025-11-26T19:24:17+00:00 |
| URL | https://corpuschristi.oldham.sch.uk/robots.txt |
| Redirect | https://www.corpuschristi.oldham.sch.uk/robots.txt |
| Redirect Domain | www.corpuschristi.oldham.sch.uk |
| Redirect Base | corpuschristi.oldham.sch.uk |
| Domain IPs | 109.228.40.216 |
| Redirect IPs | 109.228.40.216 |
| Response IP | 109.228.40.216 |
| Found | Yes |
| Hash | 329f89e6a8d7a14548ae2a24136e5527c7a102da5f3e3ba3e616ace6b698119d |
| SimHash | 79155c13c791 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /admin/ |
| Disallow | /bin/ |
| Disallow | /Connections/ |
| Allow | /i/ |
| Disallow | /inc/ |
| Disallow | /docs/ |
| Disallow | /*.pdf$ |
| Disallow | /*.doc$ |
| Disallow | /*.xls$ |
| Disallow | /*.docx$ |
| Allow | /inc/gallery/ |
| Allow | /i/photos/Gallery/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://corpuschristircps.greenhousecms.co.uk/sitemap.xml |