bayernstart.de
robots.txt

Robots Exclusion Standard data for bayernstart.de

Resource Scan

Scan Details

Site Domain bayernstart.de
Base Domain bayernstart.de
Scan Status Ok
Last Scan2025-08-26T17:38:19+00:00
Next Scan 2025-09-02T17:38:19+00:00

Last Scan

Scanned2025-08-26T17:38:19+00:00
URL http://bayernstart.de/robots.txt
Domain IPs 193.218.202.89
Response IP 193.218.202.89
Found Yes
Hash a483c002de69dc5de1899673e6dfad0f9edbb68a5a3e93374cb43136acac2d09
SimHash 63011b588b25

Groups

*

Rule Path
Disallow /sub/paywall/js/
Disallow /lightweight-ajax
Disallow /*?trafficsource
Disallow /suche/
Disallow /*?cmp=defrss
Disallow /test/
Disallow /west/
Disallow /fdn/bootstrap/
Disallow /bi/bootstrap/
Disallow /bi/doop/
Disallow /bi/dev/
Disallow /sso/
Disallow /*?utm_source=
Disallow /*?utm_medium=
Disallow /*?utm_campaign=

xovi

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /test/
Disallow /west/

gptbot

Rule Path
Allow /ueber-uns/
Disallow /

ccbot

Rule Path
Allow /ueber-uns/
Disallow /

msnbot

Rule Path
Disallow /test/
Disallow /west/

Other Records

Field Value
crawl-delay 5

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.merkur.de/news.xml

Comments

  • robots.txt www.merkur.de
  • Legal notice: www.merkur.de expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access www.merkur.de or collect or mine data without the express permission of www.merkur.de is strictly prohibited.