tv4.se
robots.txt

Robots Exclusion Standard data for tv4.se

Resource Scan

Scan Details

Site Domain tv4.se
Base Domain tv4.se
Scan Status Ok
Last Scan2024-09-19T13:59:45+00:00
Next Scan 2024-09-26T13:59:45+00:00

Last Scan

Scanned2024-09-19T13:59:45+00:00
URL https://tv4.se/robots.txt
Redirect https://www.tv4.se:443/robots.txt
Redirect Domain www.tv4.se
Redirect Base tv4.se
Domain IPs 13.48.54.138, 13.49.132.72, 16.171.212.184
Redirect IPs 65.9.112.22, 65.9.112.51, 65.9.112.63, 65.9.112.82
Response IP 3.164.206.33
Found Yes
Hash e1cd1f8b5763f2f961468bc2f90b1702fcb5af03a4f3fe2025a7fc9916016172
SimHash a206d864a8a6

Groups

*

Rule Path
Disallow /rss/kb
Disallow /health
Disallow /8cbf4ebb-4570-4351-a0ad-45b19148e4de

grapeshot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tv4.se/sitemap.xml

Comments

  • OpenAI
  • Google
  • Claude / Anthropic
  • Common Crawl
  • Facebook
  • webz.io