qclife.wbtv.com
robots.txt

Robots Exclusion Standard data for qclife.wbtv.com

Resource Scan

Scan Details

Site Domain qclife.wbtv.com
Base Domain wbtv.com
Scan Status Ok
Last Scan2024-05-15T04:59:57+00:00
Next Scan 2024-05-22T04:59:57+00:00

Last Scan

Scanned2024-05-15T04:59:57+00:00
URL https://qclife.wbtv.com/robots.txt
Domain IPs 2600:1413:b000:14::b857:c151, 2600:1413:b000:14::b857:c155, 72.247.127.227, 72.247.127.242
Response IP 42.99.140.203
Found Yes
Hash 64121f052a5d674d0434cbcd5bf15e6d11f547737716d315252a80d4a1c669c8
SimHash aa341834a933

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /*?outputType=apps

Other Records

Field Value
sitemap https://qclife.wbtv.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap https://qclife.wbtv.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml
sitemap https://qclife.wbtv.com/arc/outboundfeeds/video-sitemap/?outputType=xml