thomasreadeschool.org
robots.txt

Robots Exclusion Standard data for thomasreadeschool.org

Resource Scan

Scan Details

Site Domain thomasreadeschool.org
Base Domain thomasreadeschool.org
Scan Status Ok
Last Scan2025-09-20T08:22:50+00:00
Next Scan 2025-10-04T08:22:50+00:00

Last Scan

Scanned2025-09-20T08:22:50+00:00
URL https://www.thomasreadeschool.org/robots.txt
Domain IPs 88.208.240.47
Response IP 88.208.240.47
Found Yes
Hash 5cc950d39ca5190e23dd671093188ca8c0ae262ff605408e0876693473d7ec58
SimHash 79155c11cf11

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://thomasreade.greenhousecms.co.uk/sitemap.xml