mdr.de
robots.txt

Robots Exclusion Standard data for mdr.de

Resource Scan

Scan Details

Site Domain mdr.de
Base Domain mdr.de
Scan Status Ok
Last Scan2025-08-03T15:00:09+00:00
Next Scan 2025-09-02T15:00:09+00:00

Last Scan

Scanned2025-08-03T15:00:09+00:00
URL https://mdr.de/robots.txt
Redirect https://www.mdr.de/robots.txt
Redirect Domain www.mdr.de
Redirect Base mdr.de
Domain IPs 193.22.36.128
Redirect IPs 104.81.108.72
Response IP 184.30.132.193
Found Yes
Hash 51d0dffe41bd2e42dc0935cad6748f3e6c947f970a0126707f7644291f398bef
SimHash 7f5ec91087d4

Groups

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

tiktokspider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

mistralai-user

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user/2.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-user

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

teleport

Rule Path
Disallow /

webwhacker

Rule Path
Disallow /

webzip

Rule Path
Disallow /

net attache

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

httrack

Rule Path
Disallow /

webcapture

Rule Path
Disallow /

websauger

Rule Path
Disallow /

*

Rule Path
Disallow /*.xml$
Disallow /404/
Disallow /administratives/
Disallow /export/
Disallow /forum/
Disallow /kultur/rueckblick/
Disallow /medienproduktion/
Disallow /schulung/
Disallow /test/
Disallow /*-shorturl.json$
Disallow /*-shorturl*.json$
Disallow /*~fbia.html$
Disallow /mediathek/mediathek-suche--100_zc-*.html*
Disallow /mediathek/suche/mediathek-suche--100-default.html?*
Disallow /mediathek/audio-app/
Disallow /video/audio-app/
Disallow /resources/global/player/embed/
Disallow /mediathek/verbreitung/portal/
Disallow /video/verbreitung/portal/
Disallow /yourls/
Disallow /scripts4/
Disallow /chat/login.php
Disallow /CONT/teletext/*
Disallow /quizzes/
Allow /scripts4/tippspiel/$
Allow /*-podcast.xml$
Allow /*-rss.xml$
Allow /*-avFeed.xml$
Allow /*-sitemap.xml$
Allow /video/administratives/newestvideositemap-100-mediaRss.xml$
Allow /video/administratives/newestaudiositemap-100-mediaRss.xml$
Allow /sitemap-index-100.xml
Allow /administratives/*.css
Allow /administratives/*.js
Allow /administratives/*.png
Allow /administratives/*.jpg

Other Records

Field Value
sitemap https://www.mdr.de/news-sitemap.xml
sitemap https://www.mdr.de/sitemap-index-100.xml

Comments

  • Liste der geblockten KI-Bots
  • Stand: 13.05.2025
  • Liste an Crawler (Modul SEO ARD)
  • Amazon
  • Anthropic
  • Apple
  • ByteDance
  • Cohere
  • Common Crawl
  • Diffbot
  • DuckDuckGo
  • Google
  • Huawei
  • Meta
  • Mistral
  • OpenAI
  • Perplexity
  • Webz.io
  • You.com
  • Zyte