sandmann.de
robots.txt

Robots Exclusion Standard data for sandmann.de

Resource Scan

Scan Details

Site Domain sandmann.de
Base Domain sandmann.de
Scan Status Ok
Last Scan2025-12-08T14:59:34+00:00
Next Scan 2026-01-07T14:59:34+00:00

Last Scan

Scanned2025-12-08T14:59:34+00:00
URL https://sandmann.de/robots.txt
Redirect https://www.sandmann.de/robots.txt
Redirect Domain www.sandmann.de
Redirect Base sandmann.de
Domain IPs 192.108.72.56
Redirect IPs 104.90.205.89, 2a02:26f0:9c00:386::4:b55a, 2a02:26f0:9c00:388::4:b55a, 88.221.213.107
Response IP 2.21.12.168
Found Yes
Hash 04c1d8815aa553813b63faf786da3c2b38b7481a886f08d2d1ae6a78bc084dca
SimHash f3954b56b5c6

Groups

*

Rule Path
Disallow /test/
Disallow /content/rbb/
Disallow /av/
Disallow /vorlagen/

googlebot

Rule Path
Disallow /*.zip$
Disallow /*.ics$
Disallow /test/
Disallow /content/rbb/
Disallow /av/

googlebot-image

Rule Path
Disallow /content/dam/temp/
Disallow /content/dam/rbb/rbb/fernsehen/programm/
Disallow /content/dam/rbb/test/

ia_archiver

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

webwhacker*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

net attache*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

httrack*

Rule Path
Disallow /

webcapture*

Rule Path
Disallow /

websauger*

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

tiktokspider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

mistralai-user

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user/2.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-user

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

Comments

  • Disallow: /content/rbb/rbb/unternehmen/presse/audio-video/
  • Disallow: /content/rbb/rbb/unternehmen/presse/pressetermine/
  • Disallow: /content/rbb/rbb/av/unternehmen/presse/
  • keine AV-Objekte direkt
  • keine Vorlagen
  • Bilder jetzt zeigen
  • Alexa
  • Auch Sauger wollen wir sperren
  • Liste an Crawler (Modul SEO ARD). ARD-Beschluss. TOMS1-1288
  • Amazon
  • Anthropic
  • Apple
  • ByteDance
  • Cohere
  • Common Crawl
  • Diffbot
  • DuckDuckGo
  • Google
  • Huawei
  • Meta
  • Mistral
  • OpenAI
  • Perplexity
  • Webz.io
  • You.com
  • Zyte