corpuschristi.oldham.sch.uk
robots.txt

Robots Exclusion Standard data for corpuschristi.oldham.sch.uk

Resource Scan

Scan Details

Site Domain corpuschristi.oldham.sch.uk
Base Domain corpuschristi.oldham.sch.uk
Scan Status Ok
Last Scan2025-11-26T19:24:17+00:00
Next Scan 2025-12-10T19:24:17+00:00

Last Scan

Scanned2025-11-26T19:24:17+00:00
URL https://corpuschristi.oldham.sch.uk/robots.txt
Redirect https://www.corpuschristi.oldham.sch.uk/robots.txt
Redirect Domain www.corpuschristi.oldham.sch.uk
Redirect Base corpuschristi.oldham.sch.uk
Domain IPs 109.228.40.216
Redirect IPs 109.228.40.216
Response IP 109.228.40.216
Found Yes
Hash 329f89e6a8d7a14548ae2a24136e5527c7a102da5f3e3ba3e616ace6b698119d
SimHash 79155c13c791

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://corpuschristircps.greenhousecms.co.uk/sitemap.xml