trinityceprimary.school
robots.txt

Robots Exclusion Standard data for trinityceprimary.school

Resource Scan

Scan Details

Site Domain trinityceprimary.school
Base Domain trinityceprimary.school
Scan Status Ok
Last Scan2025-10-23T08:17:14+00:00
Next Scan 2025-11-06T08:17:14+00:00

Last Scan

Scanned2025-10-23T08:17:14+00:00
URL https://trinityceprimary.school/robots.txt
Redirect https://www.trinityceprimary.school/robots.txt
Redirect Domain www.trinityceprimary.school
Redirect Base trinityceprimary.school
Domain IPs 109.228.40.216
Redirect IPs 109.228.40.216
Response IP 109.228.40.216
Found Yes
Hash f5241431de2e1227f9be1df6aaefacefa634d7f135ac24bade87cad541a887f4
SimHash 79155c13cf11

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://trinitychurch.greenhousecms.co.uk/sitemap.xml