canon.ca
robots.txt

Robots Exclusion Standard data for canon.ca

Resource Scan

Scan Details

Site Domain canon.ca
Base Domain canon.ca
Scan Status Ok
Last Scan2024-11-04T16:35:07+00:00
Next Scan 2024-12-04T16:35:07+00:00

Last Scan

Scanned2024-11-04T16:35:07+00:00
URL https://canon.ca/robots.txt
Redirect https://canon.ca/dam/robots.txt
Domain IPs 146.184.161.61
Response IP 146.184.161.61
Found Yes
Hash 412adca2fc5fb1294a437009e49743f9f13264d241a49a02725ff9481d0078e3
SimHash 09085a52c791

Groups

*

Rule Path
Disallow /en/search?q=*
Disallow /fr/recherche?q=*
Disallow /botdetectcaptcha?get=*

adsbot-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://canon.ca/dam/sitemapjune2024.xml
sitemap https://www.canon.ca/dam/sitemapjune2024.xml