noz.de
robots.txt

Robots Exclusion Standard data for noz.de

Resource Scan

Scan Details

Site Domain noz.de
Base Domain noz.de
Scan Status Ok
Last Scan2024-05-30T05:49:46+00:00
Next Scan 2024-06-06T05:49:46+00:00

Last Scan

Scanned2024-05-30T05:49:46+00:00
URL https://noz.de/robots.txt
Redirect https://www.noz.de/robots.txt
Redirect Domain www.noz.de
Redirect Base noz.de
Domain IPs 18.159.179.202, 18.193.59.2, 3.127.34.154
Redirect IPs 18.65.3.110, 18.65.3.58, 18.65.3.82, 18.65.3.90, 2600:9000:25f2:1c00:19:82c2:c040:93a1, 2600:9000:25f2:4000:19:82c2:c040:93a1, 2600:9000:25f2:4e00:19:82c2:c040:93a1, 2600:9000:25f2:5000:19:82c2:c040:93a1, 2600:9000:25f2:5e00:19:82c2:c040:93a1, 2600:9000:25f2:8400:19:82c2:c040:93a1, 2600:9000:25f2:8800:19:82c2:c040:93a1, 2600:9000:25f2:a200:19:82c2:c040:93a1
Response IP 108.157.60.56
Found Yes
Hash 899caf1a66175d8230082536e7ca3d55fcbfd7944e3cb726f2991449066cdb45
SimHash 22305d004df1

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.noz.de/sitemap.xml
sitemap https://www.noz.de/sitemap/googleNewsList.xml
sitemap https://www.noz.de/sitemap/artikel/sitemap-current.xml

Comments

  • Legal notice: noz.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access noz.de or collect or mine data without the express permission of noz.de is strictly prohibited.
  • If you would like to apply for permission to crawl noz.de, collect or use data, please contact info+nutzungsrecht@noz-digital.de