wirkochengut.de
robots.txt

Robots Exclusion Standard data for wirkochengut.de

Resource Scan

Scan Details

Site Domain wirkochengut.de
Base Domain wirkochengut.de
Scan Status Ok
Last Scan2024-05-26T06:17:30+00:00
Next Scan 2024-06-02T06:17:30+00:00

Last Scan

Scanned2024-05-26T06:17:30+00:00
URL http://wirkochengut.de/robots.txt
Redirect https://www.sueddeutsche.de/robots.txt
Redirect Domain www.sueddeutsche.de
Redirect Base sueddeutsche.de
Domain IPs 2a01:238:20a:202:1082::, 81.169.145.82
Redirect IPs 2600:9000:20a6:200:1e:b6b1:7b80:93a1, 2600:9000:20a6:2200:1e:b6b1:7b80:93a1, 2600:9000:20a6:3a00:1e:b6b1:7b80:93a1, 2600:9000:20a6:400:1e:b6b1:7b80:93a1, 2600:9000:20a6:6200:1e:b6b1:7b80:93a1, 2600:9000:20a6:a000:1e:b6b1:7b80:93a1, 2600:9000:20a6:b200:1e:b6b1:7b80:93a1, 2600:9000:20a6:c000:1e:b6b1:7b80:93a1, 99.84.238.149, 99.84.238.172, 99.84.238.180, 99.84.238.91
Response IP 108.157.52.49
Found Yes
Hash 3f66b4330a66fe3231da8ae632a38f2f555f840bb8564cacba76bf81521c92b7
SimHash 32104140689f

Groups

*

Rule Path
Disallow /uss
Disallow /v1/subscriptioninfo
Disallow /cdn_sz_mob/live/iqadcontroller.js.gz
Disallow /cdn_sz/live/iqadcontroller.js.gz
Disallow /cre-1.0/tracking/*.js$
Disallow /text-to-speech/
Disallow /pay/piano/

backlink-check.de

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

openbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seodat

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

xovi

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

gumgum bot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /20*

Comments

  • Robots.txt for sueddeutsche.de
  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Exclude all other stuff for CRE tracking
  • Exclude SEO-Tools & SPAM-Bots
  • Uber Metrics
  • Googles generative AI crawlers
  • Legal notice: SZ.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access SZ.de or collect or mine data without
  • the express permission of SZ.de is strictly prohibited.
  • SZ.de may, in its discretion, permit certain automated access to certain SZ.de pages,
  • If you would like to apply for permission to crawl SZ.de, collect or use data, please email syndication@sz.de