horizonacademy.uk
robots.txt

Robots Exclusion Standard data for horizonacademy.uk

Resource Scan

Scan Details

Site Domain horizonacademy.uk
Base Domain horizonacademy.uk
Scan Status Ok
Last Scan2025-10-07T18:58:28+00:00
Next Scan 2025-10-21T18:58:28+00:00

Last Scan

Scanned2025-10-07T18:58:28+00:00
URL https://horizonacademy.uk/robots.txt
Redirect https://horizonacademytrust.co.uk/robots.txt
Redirect Domain horizonacademytrust.co.uk
Redirect Base horizonacademytrust.co.uk
Domain IPs 109.228.48.205
Redirect IPs 109.228.48.205
Response IP 109.228.48.205
Found Yes
Hash e436217bb9ddb4e5ad35f97d827770ade80ff0b4e15b6b2cfc74f6bf49bf24ee
SimHash 79155c11cf81

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://horizontrust.greenhousecms.co.uk/sitemap.xml