id.canon
robots.txt
Robots Exclusion Standard data for id.canon
Resource Scan
Scan Details
Site Domain | id.canon |
Base Domain | id.canon |
Scan Status | Ok |
Last Scan | 2024-10-29T14:40:33+00:00 |
Next Scan | 2024-11-28T14:40:33+00:00 |
Last Scan
Scanned | 2024-10-29T14:40:33+00:00 |
URL | https://id.canon/robots.txt |
Domain IPs | 13.227.254.116, 13.227.254.19, 13.227.254.62, 13.227.254.81 |
Response IP | 13.227.254.81 |
Found | Yes |
Hash | e411abd942afa00bb8b17c50de885722bbdfa52879d364192ddd4ce36b242840 |
SimHash | 3a589587e98b |
Groups
*
Rule | Path |
---|---|
Disallow | *sort%3Daz* |
Disallow | *sort%3Dza* |
Disallow | *sort%3Dnewest* |
Disallow | *sort%3Doldest* |
Disallow | *sort%3DhighestPrice* |
Disallow | *sort%3DlowestPrice* |
Disallow | */business/search?q=* |
Disallow | */consumer/search?q=* |
Disallow | */support/search?q=* |
Disallow | */support/get-search-result-content* |
Disallow | */support/download?* |
Disallow | */admin/* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |