gloucslearningalliance.org.uk
robots.txt

Robots Exclusion Standard data for gloucslearningalliance.org.uk

Resource Scan

Scan Details

Site Domain gloucslearningalliance.org.uk
Base Domain gloucslearningalliance.org.uk
Scan Status Ok
Last Scan2025-10-09T07:20:32+00:00
Next Scan 2025-10-23T07:20:32+00:00

Last Scan

Scanned2025-10-09T07:20:32+00:00
URL https://gloucslearningalliance.org.uk/robots.txt
Redirect https://glatrust.org.uk/robots.txt
Redirect Domain glatrust.org.uk
Redirect Base glatrust.org.uk
Domain IPs 88.208.230.52
Redirect IPs 88.208.230.52
Response IP 88.208.230.52
Found Yes
Hash 3756b8ebf63e2880763e31a1897642d6a5d338639f073608c31bfb5db9d8395e
SimHash 39155c13c781

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://www.gloucslearningalliance.org.uk/sitemap.xml