noz-cdn.de
robots.txt

Robots Exclusion Standard data for noz-cdn.de

Resource Scan

Scan Details

Site Domain noz-cdn.de
Base Domain noz-cdn.de
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-29T03:04:47+00:00
Next Scan 2024-12-28T03:04:47+00:00

Last Successful Scan

Scanned2023-03-09T20:32:57+00:00
URL https://noz-cdn.de/robots.txt
Redirect https://www.noz.de/robots.txt
Redirect Domain www.noz.de
Redirect Base noz.de
Domain IPs 178.15.48.197
Redirect IPs 18.161.111.18, 18.161.111.31, 18.161.111.35, 18.161.111.36, 2600:9000:2246:1800:19:82c2:c040:93a1, 2600:9000:2246:400:19:82c2:c040:93a1, 2600:9000:2246:5600:19:82c2:c040:93a1, 2600:9000:2246:5800:19:82c2:c040:93a1, 2600:9000:2246:600:19:82c2:c040:93a1, 2600:9000:2246:a200:19:82c2:c040:93a1, 2600:9000:2246:be00:19:82c2:c040:93a1, 2600:9000:2246:fe00:19:82c2:c040:93a1
Response IP 18.65.3.110
Found Yes
Hash ee28a30519ae97de2d00d5e48bdcada0d22837775cda6a51a5a69db5a2ccb73f
SimHash 2810dc04c551

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /pagebuilding/
Disallow /new-articles/
Disallow /files/
Disallow /cre-1.0/tracking/
Disallow

Other Records

Field Value
sitemap https://www.noz.de/sitemap.xml
sitemap https://www.noz.de/sitemap/googleNewsList.xml
sitemap https://www.noz.de/sitemap/artikel/sitemap-current.xml