diewirtschaft.noz.de
robots.txt

Robots Exclusion Standard data for diewirtschaft.noz.de

Resource Scan

Scan Details

Site Domain diewirtschaft.noz.de
Base Domain noz.de
Scan Status Ok
Last Scan2024-05-21T12:22:41+00:00
Next Scan 2024-05-28T12:22:41+00:00

Last Scan

Scanned2024-05-21T12:22:41+00:00
URL https://diewirtschaft.noz.de/robots.txt
Redirect https://www.dk-online.de/robots.txt
Redirect Domain www.dk-online.de
Redirect Base dk-online.de
Domain IPs 18.159.179.202, 18.193.59.2, 3.127.34.154
Redirect IPs 2600:9000:260f:0:d:e9a5:af80:93a1, 2600:9000:260f:4e00:d:e9a5:af80:93a1, 2600:9000:260f:8a00:d:e9a5:af80:93a1, 2600:9000:260f:c800:d:e9a5:af80:93a1, 2600:9000:260f:cc00:d:e9a5:af80:93a1, 2600:9000:260f:e00:d:e9a5:af80:93a1, 2600:9000:260f:e200:d:e9a5:af80:93a1, 2600:9000:260f:ec00:d:e9a5:af80:93a1, 65.9.112.32, 65.9.112.36, 65.9.112.4, 65.9.112.65
Response IP 108.157.52.95
Found Yes
Hash 76523087d33e3c8f5e0a107d0dc0ef9cbb8dc6afa697608ad7a766c491085537
SimHash 23327d00ede3

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dk-online.de/sitemap.xml
sitemap https://www.dk-online.de/sitemap/googleNewsList.xml
sitemap https://www.dk-online.de/sitemap/artikel/sitemap-current.xml

Comments

  • Legal notice: dk-online.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access dk-online.de or collect or mine data without the express permission of dk-online.de is strictly prohibited.
  • If you would like to apply for permission to crawl dk-online.de, collect or use data, please contact info+nutzungsrecht@noz-digital.de