wheatfieldsjm.herts.sch.uk
robots.txt

Robots Exclusion Standard data for wheatfieldsjm.herts.sch.uk

Resource Scan

Scan Details

Site Domain wheatfieldsjm.herts.sch.uk
Base Domain wheatfieldsjm.herts.sch.uk
Scan Status Ok
Last Scan2025-11-04T04:49:22+00:00
Next Scan 2025-11-18T04:49:22+00:00

Last Scan

Scanned2025-11-04T04:49:22+00:00
URL https://wheatfieldsjm.herts.sch.uk/robots.txt
Domain IPs 213.171.204.221
Response IP 213.171.204.221
Found Yes
Hash aa9c6e0d9d8a72c6b33ba47bf9eb5f6587e03ceec37fa9016edb637d3c180b78
SimHash 79155c13cf01

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://www.wheatfieldsjm.herts.sch.uk/sitemap.xml