derpatriot.de
robots.txt

Robots Exclusion Standard data for derpatriot.de

Resource Scan

Scan Details

Site Domain derpatriot.de
Base Domain derpatriot.de
Scan Status Ok
Last Scan2024-11-05T21:28:27+00:00
Next Scan 2024-11-12T21:28:27+00:00

Last Scan

Scanned2024-11-05T21:28:27+00:00
URL https://derpatriot.de/robots.txt
Redirect https://www.derpatriot.de/robots.txt
Redirect Domain www.derpatriot.de
Redirect Base derpatriot.de
Domain IPs 193.158.241.16, 2003:63:e019:8241:250:56ff:fe84:5832
Redirect IPs 193.158.241.16, 2003:63:e019:8241:250:56ff:fe84:5832
Response IP 193.158.241.16
Found Yes
Hash dbb737df0f693083e600027596ef690ec72d9afc05ec429fc9ad254e76e5dea1
SimHash 81015002eb33

Groups

*
mozilla/5.0 (compatible; ogdwctxcrawler)

Rule Path
Allow /fileadmin/*pdf$
Allow /fileadmin/*PDF$
Allow /fileadmin/_processed_/
Allow /fileadmin/images/
Allow /fileadmin/templates/
Allow /typo3conf/ext/*/Resources/Public/
Allow /fileadmin/patriot_edv/
Disallow /fileadmin/
Disallow /typo3/
Disallow /typo3conf/
Disallow /service/index.php
Disallow /verlag/agb.html?type=1
Disallow /verlag/impressum.html?type=1
Disallow /onlinepass-kaufen.html
Disallow /seite-nicht-gefunden.html

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.derpatriot.de/sitemap.xml
sitemap https://www.derpatriot.de/sitemap-news.xml
sitemap https://www.derpatriot.de/sitemap-article.xml

Comments

  • ===================================
  • Generator: http://pixelfolk.net/tools/robots
  • Erstellt am: 02.09.2021, 10:11
  • Webseite: https://www.derpatriot.de
  • ===================================
  • ===================================
  • Folgende Seiten sollen nicht indexiert werden:
  • ===================================
  • ===================================
  • Schließe folgende Spider komplett aus:
  • ===================================