tv-osaka.co.jp
robots.txt

Robots Exclusion Standard data for tv-osaka.co.jp

Resource Scan

Scan Details

Site Domain tv-osaka.co.jp
Base Domain tv-osaka.co.jp
Scan Status Ok
Last Scan2025-05-12T18:21:14+00:00
Next Scan 2025-05-19T18:21:14+00:00

Last Scan

Scanned2025-05-12T18:21:14+00:00
URL https://www.tv-osaka.co.jp/robots.txt
Domain IPs 13.112.229.211, 57.182.163.88
Response IP 57.182.163.88
Found Yes
Hash 07207b0b236de4e6f6712cab92aaac59448c9284a6fe5f02cff88fe98ffeebd8
SimHash 3097a105c092

Groups

ccbot

Rule Path
Disallow /news/

gptbot

Rule Path
Disallow /news/

chatgpt-user

Rule Path
Disallow /news/

google-extended

Rule Path
Disallow /news/

anthropic-ai

Rule Path
Disallow /news/

cohere-ai

Rule Path
Disallow /news/

omgili

Rule Path
Disallow /news/

omgilibot

Rule Path
Disallow /news/

icc-crawler

Rule Path
Disallow /news/

applebot-extended

Rule Path
Disallow /news/

claudebot

Rule Path
Disallow /news/

claude-web

Rule Path
Disallow /news/

perplexitybot

Rule Path
Disallow /news/

perplexity-ai

Rule Path
Disallow /news/

bytespider

Rule Path
Disallow /news/

diffbot

Rule Path
Disallow /news/

facebookbot

Rule Path
Disallow /news/

oai-searchbot

Rule Path
Disallow /news/

mj12bot

Rule Path
Disallow /news/

piplbot

Rule Path
Disallow /news/

meta-externalagent

Rule Path
Disallow /news/

timpibot

Rule Path
Disallow /news/

webzio

Rule Path
Disallow /news/

webzio-extended

Rule Path
Disallow /news/