t.kn-online.de
robots.txt

Robots Exclusion Standard data for t.kn-online.de

Resource Scan

Scan Details

Site Domain t.kn-online.de
Base Domain kn-online.de
Scan Status Ok
Last Scan2024-05-07T19:25:16+00:00
Next Scan 2024-06-06T19:25:16+00:00

Last Scan

Scanned2024-05-07T19:25:16+00:00
URL https://t.kn-online.de/robots.txt
Redirect https://www.kn-online.de/robots.txt
Redirect Domain www.kn-online.de
Redirect Base kn-online.de
Domain IPs 193.30.60.245
Redirect IPs 23.211.140.147, 23.211.140.83, 2600:1413:b000:14::b857:c150, 2600:1413:b000:14::b857:c153
Response IP 42.99.140.154
Found Yes
Hash cb587c19079c3c49dd5c7e018214c8535fee559fe4ca62284d4ae96ac0d9725c
SimHash a330577cc9a1

Groups

*

Rule Path
Disallow /disabledFunctionsForCrawlers.chunk.js
Disallow /mandanten/
Disallow /mediabox/
Disallow /politik/politik-extern/
Disallow /wirtschaft/wirtschaft-extern
Disallow /suche/
Disallow /ellipsis-preview/
Disallow /pf/api/v3/
Disallow /zeitung/
Disallow /metaseiten/
Disallow /bundles/
Disallow /cms/
Disallow /security/
Disallow /newsletter/abmeldung/
Disallow /angebot/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Comments

  • Legal notice: kn-online.de expressly reserves the right to use its content for commercialtext and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access kn-online.de or collect or minedata without the express permission of kn-online.de is strictly prohibited.
  • If you would like to apply for permission to crawl kn-online.de, collect or use data, please contact lizenzen@rnd.de