canon.ca
robots.txt
Robots Exclusion Standard data for canon.ca
Resource Scan
Scan Details
Site Domain | canon.ca |
Base Domain | canon.ca |
Scan Status | Ok |
Last Scan | 2024-11-04T16:35:07+00:00 |
Next Scan | 2024-12-04T16:35:07+00:00 |
Last Scan
Scanned | 2024-11-04T16:35:07+00:00 |
URL | https://canon.ca/robots.txt |
Redirect | https://canon.ca/dam/robots.txt |
Domain IPs | 146.184.161.61 |
Response IP | 146.184.161.61 |
Found | Yes |
Hash | 412adca2fc5fb1294a437009e49743f9f13264d241a49a02725ff9481d0078e3 |
SimHash | 09085a52c791 |
Groups
*
Rule | Path |
---|---|
Disallow | /en/search?q=* |
Disallow | /fr/recherche?q=* |
Disallow | /botdetectcaptcha?get=* |
Other Records
Field | Value |
---|---|
sitemap | https://canon.ca/dam/sitemapjune2024.xml |
sitemap | https://www.canon.ca/dam/sitemapjune2024.xml |