my.canon
robots.txt
Robots Exclusion Standard data for my.canon
Resource Scan
Scan Details
Site Domain | my.canon |
Base Domain | my.canon |
Scan Status | Ok |
Last Scan | 2024-10-28T23:38:27+00:00 |
Next Scan | 2024-11-27T23:38:27+00:00 |
Last Scan
Scanned | 2024-10-28T23:38:27+00:00 |
URL | https://my.canon/robots.txt |
Domain IPs | 18.155.68.128, 18.155.68.49, 18.155.68.54, 18.155.68.8 |
Response IP | 18.155.68.8 |
Found | Yes |
Hash | e411abd942afa00bb8b17c50de885722bbdfa52879d364192ddd4ce36b242840 |
SimHash | 3a589587e98b |
Groups
*
Rule | Path |
---|---|
Disallow | *sort%3Daz* |
Disallow | *sort%3Dza* |
Disallow | *sort%3Dnewest* |
Disallow | *sort%3Doldest* |
Disallow | *sort%3DhighestPrice* |
Disallow | *sort%3DlowestPrice* |
Disallow | */business/search?q=* |
Disallow | */consumer/search?q=* |
Disallow | */support/search?q=* |
Disallow | */support/get-search-result-content* |
Disallow | */support/download?* |
Disallow | */admin/* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |