cdn.idc.com
robots.txt
Robots Exclusion Standard data for cdn.idc.com
Resource Scan
Scan Details
Site Domain | cdn.idc.com |
Base Domain | idc.com |
Scan Status | Ok |
Last Scan | 2024-09-20T05:19:03+00:00 |
Next Scan | 2024-10-20T05:19:03+00:00 |
Last Scan
Scanned | 2024-09-20T05:19:03+00:00 |
URL | https://cdn.idc.com/robots.txt |
Domain IPs | 54.192.18.102, 54.192.18.121, 54.192.18.127, 54.192.18.34 |
Response IP | 108.156.133.118 |
Found | Yes |
Hash | 98163158f6e85ce4e7118c629e95d33460f3b92259bd6100726be9fbe6b64e7d |
SimHash | ea1dd8128f53 |
Groups
*
Rule | Path |
---|---|
Disallow | /getdoc.jsp?containerId=SEV |
Disallow | /search/ |
Disallow | /action/login |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Comments