ceolas.org
robots.txt

Robots Exclusion Standard data for ceolas.org

Resource Scan

Scan Details

Site Domain ceolas.org
Base Domain ceolas.org
Scan Status Ok
Last Scan2026-01-12T00:15:51+00:00
Next Scan 2026-01-19T00:15:51+00:00

Last Scan

Scanned2026-01-12T00:15:51+00:00
URL https://ceolas.org/robots.txt
Domain IPs 216.92.79.5
Response IP 216.92.79.5
Found Yes
Hash 40e6ef93c598d30f756235bad7489f9ff4933535a332b1b88b7115739bfdb926
SimHash 05002954b527

Groups

*

Product Comment
* i.e. the following applies to all robots
Rule Path Comment
Disallow /virtual/cfc/ -
Disallow /pub/Mail-order -
Disallow /Old -
Disallow /admin -
Disallow /im -
Disallow /s -
Disallow /pub/artists/ -
Disallow /events/by_state/text -
Disallow /events/by_artist/text -
Disallow /pub/IrishNet/ -
Disallow /Irishnet/ These end up duplicating /IrishNet
Disallow /irishnet/ -
Disallow /VL/members/ -

Comments

  • robots.txt for http://ceolas.org/