intelcom.ca
robots.txt

Robots Exclusion Standard data for intelcom.ca

Resource Scan

Scan Details

Site Domain intelcom.ca
Base Domain intelcom.ca
Scan Status Ok
Last Scan2025-11-24T07:21:12+00:00
Next Scan 2025-12-24T07:21:12+00:00

Last Scan

Scanned2025-11-24T07:21:12+00:00
URL https://intelcom.ca/robots.txt
Domain IPs 104.21.34.172, 172.67.163.78, 2606:4700:3033::ac43:a34e, 2606:4700:3035::6815:22ac
Response IP 172.67.163.78
Found Yes
Hash 26b461bf1bcaaacb8968296df3c5c870397c787dc5341ed6f718808ea3961229
SimHash 694056f15011

Groups

slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 100

gsa-crawler-www

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 100

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 100

mediapartners-google

Rule Path
Disallow

yahoo-newscrawler

Rule Path
Disallow

msnbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 100

*

Rule Path
Disallow /recherche/
Disallow /*?q=
Disallow *?q
Allow /

Other Records

Field Value
sitemap https://intelcom.ca/fr/sitemap.xml