cc.com
robots.txt

Robots Exclusion Standard data for cc.com

Resource Scan

Scan Details

Site Domain cc.com
Base Domain cc.com
Scan Status Ok
Last Scan2024-05-03T06:56:57+00:00
Next Scan 2024-05-10T06:56:57+00:00

Last Scan

Scanned2024-05-03T06:56:57+00:00
URL https://cc.com/robots.txt
Redirect https://www.cc.com/robots.txt
Redirect Domain www.cc.com
Redirect Base cc.com
Domain IPs 34.213.106.51, 54.68.182.72
Redirect IPs 23.39.53.72, 2600:1413:b000:38e::2215, 2600:1413:b000:391::2215
Response IP 23.54.58.160
Found Yes
Hash 4cd435bf3829189c8979e6a55f909665e8ffbc32f299dc45799ef4ac5d483883
SimHash 6905005ace53

Groups

*

Rule Path
Disallow /api/
Disallow /tve/
Disallow /search
Disallow /feeds/latest_results/
Disallow /fragments/

twitterbot

Rule Path
Disallow /
Allow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cc.com/xmlsitemap/video
sitemap https://www.cc.com/xmlsitemap/photogallery
sitemap https://www.cc.com/xmlsitemap/episode
sitemap https://www.cc.com/xmlsitemap/season
sitemap https://www.cc.com/xmlsitemap/show