dk-online.de
robots.txt

Robots Exclusion Standard data for dk-online.de

Resource Scan

Scan Details

Site Domain dk-online.de
Base Domain dk-online.de
Scan Status Ok
Last Scan2024-06-07T12:42:12+00:00
Next Scan 2024-06-14T12:42:12+00:00

Last Scan

Scanned2024-06-07T12:42:12+00:00
URL https://dk-online.de/robots.txt
Redirect https://www.dk-online.de/robots.txt
Redirect Domain www.dk-online.de
Redirect Base dk-online.de
Domain IPs 18.159.179.202, 18.193.59.2, 3.127.34.154
Redirect IPs 18.164.154.15, 18.164.154.56, 18.164.154.74, 18.164.154.88, 2600:9000:24b9:1c00:d:e9a5:af80:93a1, 2600:9000:24b9:2c00:d:e9a5:af80:93a1, 2600:9000:24b9:5c00:d:e9a5:af80:93a1, 2600:9000:24b9:8600:d:e9a5:af80:93a1, 2600:9000:24b9:9200:d:e9a5:af80:93a1, 2600:9000:24b9:9c00:d:e9a5:af80:93a1, 2600:9000:24b9:b200:d:e9a5:af80:93a1, 2600:9000:24b9:c200:d:e9a5:af80:93a1
Response IP 3.160.246.114
Found Yes
Hash 76523087d33e3c8f5e0a107d0dc0ef9cbb8dc6afa697608ad7a766c491085537
SimHash 23327d00ede3

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dk-online.de/sitemap.xml
sitemap https://www.dk-online.de/sitemap/googleNewsList.xml
sitemap https://www.dk-online.de/sitemap/artikel/sitemap-current.xml

Comments

  • Legal notice: dk-online.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access dk-online.de or collect or mine data without the express permission of dk-online.de is strictly prohibited.
  • If you would like to apply for permission to crawl dk-online.de, collect or use data, please contact info+nutzungsrecht@noz-digital.de