cc.com
robots.txt
Robots Exclusion Standard data for cc.com
Resource Scan
Scan Details
Site Domain | cc.com |
Base Domain | cc.com |
Scan Status | Ok |
Last Scan | 2024-05-03T06:56:57+00:00 |
Next Scan | 2024-05-10T06:56:57+00:00 |
Last Scan
Scanned | 2024-05-03T06:56:57+00:00 |
URL | https://cc.com/robots.txt |
Redirect | https://www.cc.com/robots.txt |
Redirect Domain | www.cc.com |
Redirect Base | cc.com |
Domain IPs | 34.213.106.51, 54.68.182.72 |
Redirect IPs | 23.39.53.72, 2600:1413:b000:38e::2215, 2600:1413:b000:391::2215 |
Response IP | 23.54.58.160 |
Found | Yes |
Hash | 4cd435bf3829189c8979e6a55f909665e8ffbc32f299dc45799ef4ac5d483883 |
SimHash | 6905005ace53 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /tve/ |
Disallow | /search |
Disallow | /feeds/latest_results/ |
Disallow | /fragments/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cc.com/xmlsitemap/video |
sitemap | https://www.cc.com/xmlsitemap/photogallery |
sitemap | https://www.cc.com/xmlsitemap/episode |
sitemap | https://www.cc.com/xmlsitemap/season |
sitemap | https://www.cc.com/xmlsitemap/show |