teufelseinhorn.de
robots.txt

Robots Exclusion Standard data for teufelseinhorn.de

Resource Scan

Scan Details

Site Domain teufelseinhorn.de
Base Domain teufelseinhorn.de
Scan Status Ok
Last Scan2026-01-12T04:26:26+00:00
Next Scan 2026-02-11T04:26:26+00:00

Last Scan

Scanned2026-01-12T04:26:26+00:00
URL https://teufelseinhorn.de/robots.txt
Domain IPs 37.218.254.104
Response IP 37.218.254.104
Found Yes
Hash 3913fd431f63c2b772223a00312e8d24f34030155393ace70e7802292a3bfad5
SimHash d11c7481f4c3

Groups

*

Rule Path
Disallow /
Disallow /guestbook/guestbook.php
Disallow /gb
Disallow /4images
Disallow /Galerie
Disallow /wbblite
Disallow /wbb2
Disallow /kontakt.html
Disallow /contact.html
Disallow /shoutbox
Disallow /images
Disallow /kontakt
Disallow /phpMyAdmin/
Disallow /cgi-bin/

googlebot-image

Rule Path
Disallow /

gagarobot

Rule Path
Disallow /

webwhacker*

Rule Path
Disallow /

extractorpro*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

webstripper*

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

net attache*

Rule Path
Disallow /

offline explorer*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

webcopier*

Rule Path
Disallow /

httrack*

Rule Path
Disallow /

webcapture*

Rule Path
Disallow /

websauger*

Rule Path
Disallow /

emailcollector*

Rule Path
Disallow /

roverbot*

Rule Path
Disallow /

extractorpro*

Rule Path
Disallow /

wx_mail/2.000*

Rule Path
Disallow /
Disallow /

whowhere*

Rule Path
Disallow /

activeagent*

Rule Path
Disallow /

emailsiphon*

Rule Path
Disallow /

mozilla.*newt

Rule Path
Disallow /

crescent*

Rule Path
Disallow /

cherrypicker*

Rule Path
Disallow /

Comments

  • OLR
  • E-Mail Collector raus

Warnings

  • 3 invalid lines.
  • `useragent` is not a known field.