tb.no
robots.txt

Robots Exclusion Standard data for tb.no

Resource Scan

Scan Details

Site Domain tb.no
Base Domain tb.no
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-23T05:21:53+00:00
Next Scan 2025-11-22T05:21:53+00:00

Last Successful Scan

Scanned2025-09-23T16:16:38+00:00
URL https://tb.no/robots.txt
Redirect https://www.tb.no/robots.txt
Redirect Domain www.tb.no
Redirect Base tb.no
Domain IPs 2a02:c0:ac::e51:1, 87.238.38.1, 87.238.38.2
Redirect IPs 104.18.22.107, 104.18.23.107, 2606:4700::6812:166b, 2606:4700::6812:176b
Response IP 104.18.23.107
Found Yes
Hash 353abad17ec1bf99fe1c5ab411d3df9a9eb741a182db3cd25389c28faecfad7e
SimHash 4148c851e171

Groups

bizinformasjon

Rule Path
Disallow /

*

Rule Path
Allow /ads.txt
Disallow /eksport
Disallow /popup
Disallow /jsp
Disallow /apps/pbcs.dll
Disallow /kultur/kulturkalender
Disallow /kultur/hva-skjer
Disallow /kultur/ut-guiden
Disallow /kultur/ut-i-dag
Disallow /kulturkalenderen
Disallow /kommentarliste*
Disallow /polopoly/JSON-RPC/
Disallow /polopoly/CM/

Other Records

Field Value
crawl-delay 5

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Comments

  • Robots.txt
  • Be nice.
  • Agenda
  • Polopoly
  • Start AI crawler block
  • End AI crawler block