www1.nhk.or.jp
robots.txt

Robots Exclusion Standard data for www1.nhk.or.jp

Resource Scan

Scan Details

Site Domain www1.nhk.or.jp
Base Domain nhk.or.jp
Scan Status Ok
Last Scan2024-11-03T07:45:49+00:00
Next Scan 2024-12-03T07:45:49+00:00

Last Scan

Scanned2024-11-03T07:45:49+00:00
URL https://www1.nhk.or.jp/robots.txt
Domain IPs 101.102.235.203, 202.247.51.203, 202.79.241.203, 202.79.241.44
Response IP 101.102.235.203
Found Yes
Hash 20202f108f48c82fbbc5ec6d0500b34bf0418d1f6895547d9dacf5589171b7b1
SimHash 490cf940e611

Groups

gptbot
chatgpt-user
claudebot
google-extended
applebot-extended
anthropic-ai
cohere-ai
ccbot
icc-crawler
bytespider

Rule Path
Disallow /

petalbot
baiduspider
baiduimagespider
googlebot-video

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /*/r/

*

Rule Path
Disallow /
Allow /robots.txt
Allow /favicon.ico
Allow /asaichi/

Other Records

Field Value
sitemap https://www.nhk.or.jp/sitemap.xml