www3.nhk.or.jp
robots.txt

Robots Exclusion Standard data for www3.nhk.or.jp

Resource Scan

Scan Details

Site Domain www3.nhk.or.jp
Base Domain nhk.or.jp
Scan Status Ok
Last Scan2024-11-06T14:46:05+00:00
Next Scan 2024-12-06T14:46:05+00:00

Last Scan

Scanned2024-11-06T14:46:05+00:00
URL https://www3.nhk.or.jp/robots.txt
Domain IPs 23.36.48.160
Response IP 23.54.56.170
Found Yes
Hash e69b176f4a5424259a79329b2622e4e66e7bd8a4721e380367abfcc2a642b1bf
SimHash 481c4941f2f3

Groups

gptbot
chatgpt-user
claudebot
google-extended
applebot-extended
anthropic-ai
cohere-ai
ccbot
icc-crawler
bytespider

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /*/r/

googlebot-image

Rule Path
Disallow /*/r/

googlebot-video

Rule Path
Disallow /*/r/

googlebot

Rule Path
Disallow /*/r/

applebot

Rule Path
Allow /
Disallow /*/r/

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /*/r/

Other Records

Field Value
sitemap https://www3.nhk.or.jp/news/sitemap-news-index.xml
sitemap https://www3.nhk.or.jp/nhkworld/sitemap.xml