tv.nrk.no
robots.txt

Robots Exclusion Standard data for tv.nrk.no

Resource Scan

Scan Details

Site Domain tv.nrk.no
Base Domain nrk.no
Scan Status Ok
Last Scan2025-03-22T23:00:05+00:00
Next Scan 2025-04-05T23:00:05+00:00

Last Scan

Scanned2025-03-22T23:00:05+00:00
URL https://tv.nrk.no/robots.txt
Domain IPs 23.52.171.122, 23.52.171.153, 2600:1413:1::48f7:7feb, 2600:1413:1::48f7:7ff2
Response IP 23.52.171.153
Found Yes
Hash f1a0b9e7aff2fff24377e492d067709356284dc57ddc3ab8a3389ab5a3c2e1c9
SimHash 30995a7ad361

Groups

*

Rule Path
Allow /
Disallow /auth*
Disallow /_auth*
Disallow /sok?*
Disallow /utvikler*
Disallow /oppdater*
Disallow /programmer/offline*
Disallow /video/nrk-tv_nrktv*
Disallow /serie/test-nrk-tv*
Disallow /program/TEST*
Disallow /program/*/nrks-testklipp
Disallow /programmer/test-page
Disallow /programmer/static-test-page
Disallow /programmer/cinematic-test-page
Disallow /serie/katastrofen-kielland
Disallow /serie/mordbrannen
Disallow /serie/norge-bak-fasaden
Disallow /serie/tv2-*
Disallow /atomic/*

amazonbot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
gptbot
google-extended
imagesiftbot
meta-externalagent
newsnow
oai-searchbot
perplexitybot
scrapy
timpibot
webzio-extended
anthropic-ai
cohere-ai
meta-externalagent
news-please
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

Comments

  • disallow auth routes
  • disallow custom routes
  • disallow nrktv4-9
  • disallow test content
  • disallow custom third party content
  • TODO: #4878 Disallowing test urls for atomic API, until it is ready
  • Disallowed user agents
  • NRK does not permit use of our content for the purpose of text and datamining,
  • including training of large language models or other artificial intelligence technology,
  • without express written permission from NRK.