manorgreenprimaryacademy.co.uk
robots.txt

Robots Exclusion Standard data for manorgreenprimaryacademy.co.uk

Resource Scan

Scan Details

Site Domain manorgreenprimaryacademy.co.uk
Base Domain manorgreenprimaryacademy.co.uk
Scan Status Ok
Last Scan2025-09-22T22:28:08+00:00
Next Scan 2025-10-06T22:28:08+00:00

Last Scan

Scanned2025-09-22T22:28:08+00:00
URL https://manorgreenprimaryacademy.co.uk/robots.txt
Redirect https://www.manorgreenprimaryacademy.co.uk/robots.txt
Redirect Domain www.manorgreenprimaryacademy.co.uk
Redirect Base manorgreenprimaryacademy.co.uk
Domain IPs 88.208.240.47
Redirect IPs 88.208.240.47
Response IP 88.208.240.47
Found Yes
Hash 6cf736b3f7acc64f401590f15974f284107a84b3d56b28e7ef0be23ee88837b8
SimHash 39155c13cf01

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://manorgreenpa.greenhousecms.co.uk/sitemap.xml