nhk.or.jp
robots.txt

Robots Exclusion Standard data for nhk.or.jp

Resource Scan

Scan Details

Site Domain nhk.or.jp
Base Domain nhk.or.jp
Scan Status Ok
Last Scan2024-11-09T17:38:36+00:00
Next Scan 2024-12-09T17:38:36+00:00

Last Scan

Scanned2024-11-09T17:38:36+00:00
URL https://www.nhk.or.jp/robots.txt
Domain IPs 23.36.48.160
Response IP 23.54.56.170
Found Yes
Hash 9a05f22a15148a07bc5292b5ace47b7ee5aa81d3264ed8359bacfe927d205f40
SimHash d93cf34053f4

Groups

gptbot
chatgpt-user
claudebot
google-extended
applebot-extended
anthropic-ai
cohere-ai
ccbot
icc-crawler
bytespider

Rule Path
Disallow /

petalbot
baiduspider
baiduimagespider

Rule Path
Disallow /

googlebot-image
googlebot-video

Rule Path
Disallow /
Disallow /archives/k/
Disallow /archives/r/
Disallow /das/k/
Disallow /das/r/
Disallow /gendai/k/
Disallow /gendai/r/
Disallow /heart-net/k/
Disallow /heart-net/r/
Disallow /kenko/k/
Disallow /kenko/r/
Disallow /learning/k/
Disallow /learning/r/
Disallow /learning-blog/r/
Disallow /news/k/
Disallow /news/r/
Disallow /nhkworld/k/
Disallow /nhkworld/r/
Disallow /politics/k/
Disallow /politics/r/
Disallow /recipe/k/
Disallow /recipe/r/
Disallow /shutoken/k/
Disallow /shutoken/r/
Allow /robots.txt
Allow /favicon.ico
Allow /*/lreport/
Allow /archives/
Allow /assets/
Allow /common/
Allow /das/
Allow /gendai/
Allow /heart-net/
Allow /kenko/
Allow /learning/
Allow /learning-blog/
Allow /news/
Allow /nhkworld/
Allow /politics/
Allow /prog/
Allow /program/
Allow /recipe/
Allow /shutoken/

twitterbot

Rule Path
Disallow /*/r/

*

Rule Path
Disallow /*.cgi$
Disallow /*.cgi?
Disallow /*/api/
Disallow /*/hc/
Disallow /*/k/
Disallow /*/r/
Disallow /*/shv/
Disallow /cgiblog/
Disallow /cgisearch/
Disallow /chronicle/
Disallow /error/
Disallow /hc-*
Disallow /hensei/
Disallow /saigai/*/dl/
Disallow /saigai/*/dn/
Disallow /saigai/*/ev/
Disallow /saigai/*/ss/
Disallow /saigai/f/
Disallow /topepg/
Disallow /toppage/errors/

Other Records

Field Value
sitemap https://www.nhk.or.jp/sitemap.xml