quiz.sueddeutsche.de
robots.txt

Robots Exclusion Standard data for quiz.sueddeutsche.de

Resource Scan

Scan Details

Site Domain quiz.sueddeutsche.de
Base Domain sueddeutsche.de
Scan Status Ok
Last Scan2024-06-22T19:49:55+00:00
Next Scan 2024-06-29T19:49:55+00:00

Last Scan

Scanned2024-06-22T19:49:55+00:00
URL https://quiz.sueddeutsche.de/robots.txt
Redirect https://www.sueddeutsche.de/robots.txt
Redirect Domain www.sueddeutsche.de
Redirect Base sueddeutsche.de
Domain IPs 116.202.155.172
Redirect IPs 18.238.217.112, 18.238.217.47, 18.238.217.52, 18.238.217.62, 2600:9000:246b:1000:1e:b6b1:7b80:93a1, 2600:9000:246b:6000:1e:b6b1:7b80:93a1, 2600:9000:246b:9400:1e:b6b1:7b80:93a1, 2600:9000:246b:a000:1e:b6b1:7b80:93a1, 2600:9000:246b:a600:1e:b6b1:7b80:93a1, 2600:9000:246b:ae00:1e:b6b1:7b80:93a1, 2600:9000:246b:da00:1e:b6b1:7b80:93a1, 2600:9000:246b:e000:1e:b6b1:7b80:93a1
Response IP 18.165.171.100
Found Yes
Hash 3f66b4330a66fe3231da8ae632a38f2f555f840bb8564cacba76bf81521c92b7
SimHash 32104140689f

Groups

*

Rule Path
Disallow /uss
Disallow /v1/subscriptioninfo
Disallow /cdn_sz_mob/live/iqadcontroller.js.gz
Disallow /cdn_sz/live/iqadcontroller.js.gz
Disallow /cre-1.0/tracking/*.js$
Disallow /text-to-speech/
Disallow /pay/piano/

backlink-check.de

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

openbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seodat

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

xovi

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

gumgum bot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /20*

Comments

  • Robots.txt for sueddeutsche.de
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Exclude all other stuff for CRE tracking
  • Exclude SEO-Tools & SPAM-Bots
  • Uber Metrics
  • Googles generative AI crawlers
  • Legal notice: SZ.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access SZ.de or collect or mine data without
  • the express permission of SZ.de is strictly prohibited.
  • SZ.de may, in its discretion, permit certain automated access to certain SZ.de pages,
  • If you would like to apply for permission to crawl SZ.de, collect or use data, please email syndication@sz.de