cbswestlandrow.ie
robots.txt

Robots Exclusion Standard data for cbswestlandrow.ie

Resource Scan

Scan Details

Site Domain cbswestlandrow.ie
Base Domain cbswestlandrow.ie
Scan Status Ok
Last Scan2025-09-29T13:49:27+00:00
Next Scan 2025-10-13T13:49:27+00:00

Last Scan

Scanned2025-09-29T13:49:27+00:00
URL https://www.cbswestlandrow.ie/robots.txt
Domain IPs 213.171.204.221
Response IP 213.171.204.221
Found Yes
Hash 6378f1a4650f3aa5c0820ef426ee8974ef8ebd5a3f15464206687f206f0ed133
SimHash 39155c13cf91

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://www.cbswestlandrow.ie/sitemap.xml