qclife.wbtv.com
robots.txt

Robots Exclusion Standard data for qclife.wbtv.com

Resource Scan

Scan Details

Site Domain qclife.wbtv.com
Base Domain wbtv.com
Scan Status Ok
Last Scan2024-11-06T21:28:04+00:00
Next Scan 2024-11-13T21:28:04+00:00

Last Scan

Scanned2024-11-06T21:28:04+00:00
URL https://qclife.wbtv.com/robots.txt
Domain IPs 23.45.207.206, 23.45.207.207, 2600:1413:b000:13::b857:c18e, 2600:1413:b000:13::b857:c194
Response IP 23.45.207.207
Found Yes
Hash 397ac8dc3a74d71a4fb2723ca4e4fa2a37ab89cb76d00f69b6352eda6ab09aad
SimHash 7a105940a010

Groups

gptbot
chatgpt-user
google-extended
ccbot
amazonbot
anthropic-ai
bytespider
claudebot
claude-web
facebookbot
omgili
omgilibot
perplexitybot

Rule Path
Disallow /

*

Rule Path
Disallow /*?outputType=apps

Other Records

Field Value
sitemap https://qclife.wbtv.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap https://qclife.wbtv.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml
sitemap https://qclife.wbtv.com/arc/outboundfeeds/video-sitemap/?outputType=xml