orientaldaily.on.cc
robots.txt

Robots Exclusion Standard data for orientaldaily.on.cc

Resource Scan

Scan Details

Site Domain orientaldaily.on.cc
Base Domain on.cc
Scan Status Ok
Last Scan2025-06-27T05:37:35+00:00
Next Scan 2025-07-27T05:37:35+00:00

Last Scan

Scanned2025-06-27T05:37:35+00:00
URL https://orientaldaily.on.cc/robots.txt
Domain IPs 104.17.160.210, 104.17.255.180
Response IP 104.17.255.180
Found Yes
Hash 1ff87530ac49db157b6ffc54bd55efa9b9487090718bf2d06dfba30a9325b906
SimHash 70242944e0b0

Groups

*

Rule Path
Disallow /*_cn.html
Disallow /*index.html?article_id

anthropic-ai
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
gnowitnewsbot
google-extended
gptbot
leikibot
meta-externalagent
perplexitybot
quora-bot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://orientaldaily.on.cc/sitemap.xml

Warnings

  • 1 invalid line.