woodlands.herts.sch.uk
robots.txt

Robots Exclusion Standard data for woodlands.herts.sch.uk

Resource Scan

Scan Details

Site Domain woodlands.herts.sch.uk
Base Domain woodlands.herts.sch.uk
Scan Status Ok
Last Scan2025-12-03T05:29:29+00:00
Next Scan 2025-12-17T05:29:29+00:00

Last Scan

Scanned2025-12-03T05:29:29+00:00
URL https://woodlands.herts.sch.uk/robots.txt
Domain IPs 88.208.230.52
Response IP 88.208.230.52
Found Yes
Hash ee3e41b721e7a26199758a02b10158524e4598e5f2e1ba151c5fe0c684feeb87
SimHash 79155c13c701

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://www.woodlands.herts.sch.uk/sitemap.xml