wlfs.org
robots.txt

Robots Exclusion Standard data for wlfs.org

Resource Scan

Scan Details

Site Domain wlfs.org
Base Domain wlfs.org
Scan Status Ok
Last Scan2025-09-15T17:33:10+00:00
Next Scan 2025-09-29T17:33:10+00:00

Last Scan

Scanned2025-09-15T17:33:10+00:00
URL https://wlfs.org/robots.txt
Redirect https://www.wlfs.org/robots.txt
Redirect Domain www.wlfs.org
Redirect Base wlfs.org
Domain IPs 213.171.204.221
Redirect IPs 213.171.204.221
Response IP 213.171.204.221
Found Yes
Hash 759fa68cc0fba79e228a1e04c2b95f58b61896e3668b1f0aaba590145c8fc310
SimHash 39155c11cf01

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://wlfs.co.uk.213-171-204-221.greenschoolsonline.co.uk/sitemap.xml