globeschool.org.uk
robots.txt

Robots Exclusion Standard data for globeschool.org.uk

Resource Scan

Scan Details

Site Domain globeschool.org.uk
Base Domain globeschool.org.uk
Scan Status Ok
Last Scan2025-09-21T09:58:06+00:00
Next Scan 2025-10-05T09:58:06+00:00

Last Scan

Scanned2025-09-21T09:58:06+00:00
URL https://globeschool.org.uk/robots.txt
Domain IPs 88.208.200.194
Response IP 88.208.200.194
Found Yes
Hash ffcbcb42af8262b0244be38e7deab2934efbe47be75fdcbcc63f8d4c76fd3e06
SimHash 39155c11cf81

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://globeschool.org.uk.88-208-200-194.greenschoolsonline.co.uk/sitemap.xml