neue-oz.de
robots.txt

Robots Exclusion Standard data for neue-oz.de

Resource Scan

Scan Details

Site Domain neue-oz.de
Base Domain neue-oz.de
Scan Status Ok
Last Scan2024-08-24T00:00:18+00:00
Next Scan 2024-09-23T00:00:18+00:00

Last Scan

Scanned2024-08-24T00:00:18+00:00
URL http://neue-oz.de/robots.txt
Redirect https://www.noz.de/robots.txt
Redirect Domain www.noz.de
Redirect Base noz.de
Domain IPs 62.116.130.8
Redirect IPs 13.226.2.126, 13.226.2.19, 13.226.2.43, 13.226.2.95, 2600:9000:23d1:2200:19:82c2:c040:93a1, 2600:9000:23d1:2400:19:82c2:c040:93a1, 2600:9000:23d1:3000:19:82c2:c040:93a1, 2600:9000:23d1:5800:19:82c2:c040:93a1, 2600:9000:23d1:7c00:19:82c2:c040:93a1, 2600:9000:23d1:ba00:19:82c2:c040:93a1, 2600:9000:23d1:ea00:19:82c2:c040:93a1, 2600:9000:23d1:ee00:19:82c2:c040:93a1
Response IP 13.226.2.126
Found Yes
Hash 899caf1a66175d8230082536e7ca3d55fcbfd7944e3cb726f2991449066cdb45
SimHash 22305d004df1

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.noz.de/sitemap.xml
sitemap https://www.noz.de/sitemap/googleNewsList.xml
sitemap https://www.noz.de/sitemap/artikel/sitemap-current.xml

Comments

  • Legal notice: noz.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access noz.de or collect or mine data without the express permission of noz.de is strictly prohibited.
  • If you would like to apply for permission to crawl noz.de, collect or use data, please contact info+nutzungsrecht@noz-digital.de