cahc.ca
robots.txt

Robots Exclusion Standard data for cahc.ca

Resource Scan

Scan Details

Site Domain cahc.ca
Base Domain cahc.ca
Scan Status Ok
Last Scan2024-05-30T22:18:28+00:00
Next Scan 2024-06-13T22:18:28+00:00

Last Scan

Scanned2024-05-30T22:18:28+00:00
URL https://cahc.ca/robots.txt
Redirect https://www.cahc.ca/robots.txt
Redirect Domain www.cahc.ca
Redirect Base cahc.ca
Domain IPs 13.33.88.51, 13.33.88.69, 13.33.88.88, 13.33.88.99
Redirect IPs 13.33.88.51, 13.33.88.69, 13.33.88.88, 13.33.88.99
Response IP 13.33.88.99
Found Yes
Hash cae44ce3ed7626013c3b2e403c12a7d7c1d01de9dad167feda8f9f72a676b397
SimHash 78599ce4a38b

Groups

*

Rule Path
Disallow /dash/
Disallow /pp*
Disallow /trackback
Disallow /cgi-bin
Disallow /search
Disallow /rss
Disallow /comments/feed
Disallow /*/trackback/$
Disallow /hero_slides
Disallow /banners

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.cahc.ca/sitemap.xml