www.csai.canon.com
robots.txt
Robots Exclusion Standard data for www.csai.canon.com
Resource Scan
Scan Details
Site Domain | www.csai.canon.com |
Base Domain | canon.com |
Scan Status | Ok |
Last Scan | 2024-11-03T11:40:27+00:00 |
Next Scan | 2024-11-17T11:40:27+00:00 |
Last Scan
Scanned | 2024-11-03T11:40:27+00:00 |
URL | https://www.csai.canon.com/robots.txt |
Domain IPs | 23.215.7.10, 23.215.7.18, 2600:1413:b000:1b::17d7:70a, 2600:1413:b000:1b::17d7:712 |
Response IP | 96.17.180.50 |
Found | Yes |
Hash | 9273ac6bf5ce0cf5022bee4fa6f300a609e144601d37c272542f0b4907df26e5 |
SimHash | 7901cc46ced3 |
Groups
*
Rule | Path |
---|---|
Disallow | */%21ut |
Disallow | */search |
Disallow | */product_compare/ |
Disallow | */consumer-catalog |
Disallow | */catalog/category/index/id/ |
Disallow | */checkout/ |
Disallow | */customer/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.usa.canon.com/sitemap.xml |