wlfs-sixthform.org
robots.txt

Robots Exclusion Standard data for wlfs-sixthform.org

Resource Scan

Scan Details

Site Domain wlfs-sixthform.org
Base Domain wlfs-sixthform.org
Scan Status Ok
Last Scan2025-09-13T10:53:47+00:00
Next Scan 2025-09-27T10:53:47+00:00

Last Scan

Scanned2025-09-13T10:53:47+00:00
URL https://wlfs-sixthform.org/robots.txt
Redirect https://www.wlfs-sixthform.org/robots.txt
Redirect Domain www.wlfs-sixthform.org
Redirect Base wlfs-sixthform.org
Domain IPs 109.228.48.205
Redirect IPs 109.228.48.205
Response IP 109.228.48.205
Found Yes
Hash 33f575781a116e800020335b24e452a4ea31acf111e8aae3eec7918e28517f31
SimHash 39155c13c711

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://wlfssixth.greenhousecms.co.uk/sitemap.xml