nd-aktuell.de
robots.txt

Robots Exclusion Standard data for nd-aktuell.de

Resource Scan

Scan Details

Site Domain nd-aktuell.de
Base Domain nd-aktuell.de
Scan Status Ok
Last Scan2024-09-16T06:39:36+00:00
Next Scan 2024-09-23T06:39:36+00:00

Last Scan

Scanned2024-09-16T06:39:36+00:00
URL https://nd-aktuell.de/robots.txt
Domain IPs 2a01:30:0:505:2f4:75ff:fe1e:376f, 83.223.86.50
Response IP 83.223.86.50
Found Yes
Hash 06288f20b2401d1db4847516ab44798aea933ed51313742b4ec149928afb8263
SimHash 2f34dcf2c5d8

Groups

*

Rule Path
Disallow /artikel.asp
Disallow /bannerdeliver.php
Disallow /blobs/
Disallow /epub/
Disallow /leserbrief/
Disallow /profile/
Disallow /suche
Disallow /suche/
Disallow /tag-rss/
Disallow /tmp/
Disallow /user/
Disallow /weiteres/
Disallow /503.php

yahoo-newscrawler

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.nd-aktuell.de/news-sitemap.xml.php
sitemap https://www.nd-aktuell.de/google-sitemap/index.xml

Comments

  • Alle robots
  • Extrawurst fuer Yahoo