tv4.se
robots.txt

Robots Exclusion Standard data for tv4.se

Resource Scan

Scan Details

Site Domain tv4.se
Base Domain tv4.se
Scan Status Ok
Last Scan2024-11-14T14:06:51+00:00
Next Scan 2024-11-21T14:06:51+00:00

Last Scan

Scanned2024-11-14T14:06:51+00:00
URL https://tv4.se/robots.txt
Redirect https://www.tv4.se:443/robots.txt
Redirect Domain www.tv4.se
Redirect Base tv4.se
Domain IPs 13.53.227.90, 16.16.43.47, 51.20.214.34
Redirect IPs 3.164.182.13, 3.164.182.45, 3.164.182.58, 3.164.182.96
Response IP 3.164.206.33
Found Yes
Hash e1cd1f8b5763f2f961468bc2f90b1702fcb5af03a4f3fe2025a7fc9916016172
SimHash a206d864a8a6

Groups

*

Rule Path
Disallow /rss/kb
Disallow /health
Disallow /8cbf4ebb-4570-4351-a0ad-45b19148e4de

grapeshot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tv4.se/sitemap.xml

Comments

  • OpenAI
  • Google
  • Claude / Anthropic
  • Common Crawl
  • Facebook
  • webz.io