helmstedt.de
robots.txt

Robots Exclusion Standard data for helmstedt.de

Resource Scan

Scan Details

Site Domain helmstedt.de
Base Domain helmstedt.de
Scan Status Ok
Last Scan2026-01-20T02:19:20+00:00
Next Scan 2026-02-19T02:19:20+00:00

Last Scan

Scanned2026-01-20T02:19:20+00:00
URL https://helmstedt.de/robots.txt
Redirect https://www.helmstedt.de/robots.txt
Redirect Domain www.helmstedt.de
Redirect Base helmstedt.de
Domain IPs 2a07:aa00:0:12:162::1, 62.67.46.162
Redirect IPs 2a07:aa00:0:12:162::1, 62.67.46.162
Response IP 62.67.46.162
Found Yes
Hash 7bab5acc3518f53009b31a7080820a29bfc7fc115b47181f003f31ec4483685f
SimHash c215d5241757

Groups

emailcollector

Rule Path
Disallow /

gagarobot

Rule Path
Disallow /

vscooter

Rule Path
Disallow /

roverbot*

Rule Path
Disallow /

mirago

Rule Path
Disallow /

psbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcapture*

Rule Path
Disallow /

websauger*

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

webwhacker*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

net attache*

Rule Path
Disallow /

webreaper*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

httrack*

Rule Path
Disallow /