on.cc
robots.txt
Robots Exclusion Standard data for on.cc
Resource Scan
Scan Details
Site Domain | on.cc |
Base Domain | on.cc |
Scan Status | Ok |
Last Scan | 2024-05-17T23:10:47+00:00 |
Next Scan | 2024-05-24T23:10:47+00:00 |
Last Scan
Scanned | 2024-05-17T23:10:47+00:00 |
URL | https://on.cc/robots.txt |
Domain IPs | 104.17.160.210, 104.17.255.180 |
Response IP | 104.17.255.180 |
Found | Yes |
Hash | 4a1356d9d5100f1d2df9287fdbfa0ea117ebdff083b64b30312a8bcdc58a3043 |
SimHash | 2970d2374592 |
Groups
*
Rule | Path |
---|---|
Disallow | /entertainment/ |
Disallow | /news/ |
Disallow | /finance/ |
Disallow | /sport/ |
Disallow | /cn/ |
Disallow | /tw/ |
Disallow | /int/ |
Disallow | /cgi-bin/ |
Disallow | /onad/ |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://hk.on.cc/sitemap.xml |