nwz-online.de
robots.txt

Robots Exclusion Standard data for nwz-online.de

Resource Scan

Scan Details

Site Domain nwz-online.de
Base Domain nwz-online.de
Scan Status Ok
Last Scan2024-12-21T14:48:07+00:00
Next Scan 2025-01-20T14:48:07+00:00

Last Scan

Scanned2024-12-21T14:48:07+00:00
URL http://nwz-online.de/robots.txt
Redirect https://www.nwzonline.de/robots.txt
Redirect Domain www.nwzonline.de
Redirect Base nwzonline.de
Domain IPs 80.228.114.65
Redirect IPs 80.228.115.12
Response IP 80.228.115.12
Found Yes
Hash 63ee018443faceade7b36ab0ea0d7d474916d9c2797331b22b4cfd7f3e00b2ea
SimHash a2240b6249ff

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /applications/
Disallow /guide/
Disallow /newsapp/
Disallow /newsapp-api/
Disallow /politik-meldungen/
Disallow /wichtige-meldungen/
Disallow /kurzmeldungen/
Disallow /wirtschaft-meldungen/
Disallow /wissenschaft-meldungen/
Disallow /sport-meldungen/
Disallow /panorama-meldungen/
Disallow /kultur-meldungen/
Disallow /wohnen-meldungen/
Disallow /geld-meldungen/
Disallow /beruf-meldungen/
Disallow /ernaehrung-meldungen/
Disallow /gesundheit-meldungen/
Disallow /reise-meldungen/
Disallow /politikkeywords/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Comments

  • Legal notice: nwzonline.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access nwzonline.de or collect or mine data without the express permission of nwzonline.de is strictly prohibited.
  • nwzonline.de may, in its discretion, permit certain automated access to certain nwzonline.de pages.
  • If you would like to apply for permission to crawl nwzonline.de, collect or use data, please email online@nwzmedien.de.