sz.de
robots.txt

Robots Exclusion Standard data for sz.de

Resource Scan

Scan Details

Site Domain sz.de
Base Domain sz.de
Scan Status Ok
Last Scan2024-05-14T13:32:52+00:00
Next Scan 2024-05-21T13:32:52+00:00

Last Scan

Scanned2024-05-14T13:32:52+00:00
URL https://sz.de/robots.txt
Redirect https://www.sueddeutsche.de/robots.txt
Redirect Domain www.sueddeutsche.de
Redirect Base sueddeutsche.de
Domain IPs 195.50.177.61
Redirect IPs 13.33.21.127, 13.33.21.23, 13.33.21.68, 13.33.21.72, 2600:9000:2363:2000:1e:b6b1:7b80:93a1, 2600:9000:2363:5c00:1e:b6b1:7b80:93a1, 2600:9000:2363:6400:1e:b6b1:7b80:93a1, 2600:9000:2363:7e00:1e:b6b1:7b80:93a1, 2600:9000:2363:8600:1e:b6b1:7b80:93a1, 2600:9000:2363:a00:1e:b6b1:7b80:93a1, 2600:9000:2363:bc00:1e:b6b1:7b80:93a1, 2600:9000:2363:f400:1e:b6b1:7b80:93a1
Response IP 18.165.171.47
Found Yes
Hash 3fd82d8355004822e375d73989d4a7c7fa25be09549339b3c963d2f59db2d6ac
SimHash 32000100689f

Groups

*

Rule Path
Disallow /uss
Disallow /v1/subscriptioninfo
Disallow /cdn_sz_mob/live/iqadcontroller.js.gz
Disallow /cdn_sz/live/iqadcontroller.js.gz
Disallow /cre-1.0/tracking/*.js$
Disallow /text-to-speech/
Disallow /pay/piano/

backlink-check.de

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

openbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seodat

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

xovi

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Comments

  • Robots.txt for sueddeutsche.de
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Exclude all other stuff for CRE tracking
  • Exclude SEO-Tools & SPAM-Bots
  • Uber Metrics
  • Googles generative AI crawlers
  • Legal notice: SZ.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access SZ.de or collect or mine data without
  • the express permission of SZ.de is strictly prohibited.
  • SZ.de may, in its discretion, permit certain automated access to certain SZ.de pages,
  • If you would like to apply for permission to crawl SZ.de, collect or use data, please email syndication@sz.de