thecw.com
robots.txt

Robots Exclusion Standard data for thecw.com

Resource Scan

Scan Details

Site Domain thecw.com
Base Domain thecw.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-31T23:09:26+00:00
Next Scan 2024-11-29T23:09:26+00:00

Last Successful Scan

Scanned2024-02-04T22:05:04+00:00
URL https://thecw.com/robots.txt
Domain IPs 104.21.38.120, 172.67.222.151, 2606:4700:3031::6815:2678, 2606:4700:3037::ac43:de97
Response IP 172.67.222.151
Found Yes
Hash ee43e2508e105aa22d9daf4e697f0c41485a2213e119572fea70adf8974c707e
SimHash 200b5b00ed53

Groups

*

Rule Path
Disallow /contest/enter/daily-video-qa1

Other Records

Field Value
sitemap https://www.cwtv.com/sitemap.xml
sitemap https://www.cwtv.com/images/c/xml/videositemap.xml
sitemap https://images.cwtv.com/feed/google-search/

Comments

  • robots.txt for https://www.cwtv.com/