gwct.org
robots.txt

Robots Exclusion Standard data for gwct.org

Resource Scan

Scan Details

Site Domain gwct.org
Base Domain gwct.org
Scan Status Ok
Last Scan2025-10-03T04:39:44+00:00
Next Scan 2025-11-02T04:39:44+00:00

Last Scan

Scanned2025-10-03T04:39:44+00:00
URL https://gwct.org/robots.txt
Redirect https://www.gwct.org/robots.txt
Redirect Domain www.gwct.org
Redirect Base gwct.org
Domain IPs 70.32.74.195
Redirect IPs 70.32.74.195
Response IP 70.32.74.195
Found Yes
Hash 5ce70a4b58d32396073f2dfe97fb1a4ab4c21f957cdb92891e3ca467fee1629f
SimHash 7e0814c8e194

Groups

*

Rule Path
Disallow /application/attributes
Disallow /application/authentication
Disallow /application/bootstrap
Disallow /application/config
Disallow /application/controllers
Disallow /application/elements
Disallow /application/helpers
Disallow /application/jobs
Disallow /application/languages
Disallow /application/mail
Disallow /application/models
Disallow /application/page_types
Disallow /application/single_pages
Disallow /application/tools
Disallow /application/views