swr3.de
robots.txt

Robots Exclusion Standard data for swr3.de

Resource Scan

Scan Details

Site Domain swr3.de
Base Domain swr3.de
Scan Status Ok
Last Scan2025-08-06T14:26:52+00:00
Next Scan 2025-09-05T14:26:52+00:00

Last Scan

Scanned2025-08-06T14:26:52+00:00
URL https://swr3.de/robots.txt
Redirect https://www.swr3.de:443/robots.txt
Redirect Domain www.swr3.de
Redirect Base swr3.de
Domain IPs 34.120.237.106
Redirect IPs 2a02:26f0:9c00:1aa::3121, 2a02:26f0:9c00:1b9::3121, 95.100.124.74
Response IP 104.83.87.163
Found Yes
Hash 97970fa3807062c455e4f3aca5cc1abe71fbe0191f0e81e64fb1f1c441a8697a
SimHash 271dc202a6d0

Groups

*

Rule Path
Disallow /api/
Disallow /reactions/
Disallow /search/suggest/
Disallow /cms/
Disallow /*~_currentSlide-*
Disallow /*?_pjax=%23content
Disallow /*?_pjax=%23fragment
Disallow /suche-104.html
Disallow /playlisten/index.html?*
Disallow /sendungen/index.html?*

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

tiktokspider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

mistralai-user

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user/2.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-user

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.swr3.de/~sitemap/index.xml
sitemap https://www.swr3.de/~sitemap/aktuell/nachrichten/index.xml

Comments

  • robots.txt für SWR3.de
  • Stand: 2025-06-25 10:50 CEST
  • Disallow
  • SWR3
  • Nutzungsvorbehalt KI (siehe https://gitlab.ard.de/modul-12-seo/nutzungsvorbehalt/-/raw/master/robots.txt)
  • Amazon
  • Anthropic
  • Apple
  • ByteDance
  • Cohere
  • Common Crawl
  • Diffbot
  • DuckDuckGo
  • Google
  • Huawei
  • Meta
  • Mistral
  • OpenAI
  • Perplexity
  • Webz.io
  • You.com
  • Zyte
  • Sitemaps

Warnings

  • `host` is not a known field.