sueddeutsche.me
robots.txt

Robots Exclusion Standard data for sueddeutsche.me

Resource Scan

Scan Details

Site Domain sueddeutsche.me
Base Domain sueddeutsche.me
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-12T06:03:19+00:00
Next Scan 2024-10-10T06:03:19+00:00

Last Successful Scan

Scanned2022-11-27T10:20:28+00:00
URL https://sueddeutsche.me/robots.txt
Response IP 104.21.86.94, 172.67.217.105
Found Yes
Hash 356ac8595d5af998fe9a228c6231975a374ff451528840061a783824ff7ab5d3
SimHash 105d93424a12

Groups

*

Rule Path
Disallow /cre-1.0/tracking/*.js$

*

Rule Path
Disallow /uss/

*

Rule Path
Disallow /text-to-speech/

backlink-check.de

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

openbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seodat

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

xovi

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

Comments

  • Robots.txt for sueddeutsche.de
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Exclude all other stuff for CRE tracking
  • Exclude tts Files
  • Exclude SEO-Tools & SPAM-Bots
  • Uber Metrics