schaumburger-zeitung.de
robots.txt

Robots Exclusion Standard data for schaumburger-zeitung.de

Resource Scan

Scan Details

Site Domain schaumburger-zeitung.de
Base Domain schaumburger-zeitung.de
Scan Status Ok
Last Scan2024-06-23T18:07:15+00:00
Next Scan 2024-06-30T18:07:15+00:00

Last Scan

Scanned2024-06-23T18:07:15+00:00
URL http://schaumburger-zeitung.de/robots.txt
Redirect https://www.szlz.de/robots.txt
Redirect Domain www.szlz.de
Redirect Base szlz.de
Domain IPs 2a01:488:42:1000:50ed:850a:5a:4b9a, 80.237.133.10
Redirect IPs 184.87.193.74, 184.87.193.83, 2600:1413:b000:13::b857:c18b, 2600:1413:b000:13::b857:c18f
Response IP 42.99.140.139
Found Yes
Hash 9c0b55ad474cb569b6cc70d68ebc63e1b701a8aaa8c0393d95ae353949db76f2
SimHash a334176c8da1

Groups

*

Rule Path
Disallow /disabledFunctionsForCrawlers.chunk.js
Disallow /mandanten/
Disallow /mediabox/
Disallow /politik/politik-extern/
Disallow /wirtschaft/wirtschaft-extern
Disallow /suche/
Disallow /ellipsis-preview/
Disallow /pf/api/v3/
Disallow /zeitung/
Disallow /metaseiten/
Disallow /var/storage
Disallow /var/storage/*
Disallow /bundles/
Disallow /cms/
Disallow /security/
Disallow /newsletter/abmeldung/
Disallow /angebot/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Comments

  • Legal notice: szlz.de expressly reserves the right to use its content for commercialtext and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access szlz.de or collect or minedata without the express permission of szlz.de is strictly prohibited.
  • If you would like to apply for permission to crawl szlz.de, collect or use data, please contact lizenzen@rnd.de