nw.de
robots.txt

Robots Exclusion Standard data for nw.de

Resource Scan

Scan Details

Site Domain nw.de
Base Domain nw.de
Scan Status Ok
Last Scan2024-06-23T05:31:43+00:00
Next Scan 2024-06-30T05:31:43+00:00

Last Scan

Scanned2024-06-23T05:31:43+00:00
URL https://nw.de/robots.txt
Redirect https://www.nw.de/robots.txt
Redirect Domain www.nw.de
Redirect Base nw.de
Domain IPs 77.235.162.173
Redirect IPs 77.235.162.173
Response IP 77.235.162.173
Found Yes
Hash 7fdaaab075ab2b90b454721bd50e40ea1085f8ec285cc4d6854397c6737b9785
SimHash 1ac45604d517

Groups

ccbot
gptbot
chatgpt-user
google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /tagsuche
Disallow /_em_cms/
Disallow /cms7/
Disallow /frage/
Disallow /microsites/
Disallow /suche
Disallow /profil/
Disallow /nachrichten/wirtschaft/wirtschaftsnewsletter/
Disallow /microsites/mediabox/
Disallow /anzeigen/native_ads/
Disallow /anzeigen/native_ads_oms/
Allow /_em_cms/globals/csslibs.php
Allow /_em_cms/globals/jslibs.php
Allow /_em_cms/globals/acon.php

Other Records

Field Value
sitemap https://www.nw.de/sitemap_nw_index.xml.gz
sitemap https://www.nw.de/sitemap_nw_index_news.xml.gz
sitemap https://www.nw.de/_retresco/sitemap/index.xml
sitemap https://www.nw.de/_events/sitemap/sitemap_event.xml