luise-berlin.com
robots.txt

Robots Exclusion Standard data for luise-berlin.com

Resource Scan

Scan Details

Site Domain luise-berlin.com
Base Domain luise-berlin.com
Scan Status Ok
Last Scan2025-05-06T03:51:30+00:00
Next Scan 2025-06-05T03:51:30+00:00

Last Scan

Scanned2025-05-06T03:51:30+00:00
URL https://luise-berlin.com/robots.txt
Redirect https://www.luise-berlin.com/robots.txt
Redirect Domain www.luise-berlin.com
Redirect Base luise-berlin.com
Domain IPs 85.13.140.49
Redirect IPs 85.13.140.49
Response IP 85.13.140.49
Found Yes
Hash ce86fb612b575de832f436b80daf368d92181af8214caaacf75973354c817cd2
SimHash 05201ca28a37

Groups

*

Rule Path
Disallow /matrix_engine/
Disallow /layout/
Disallow /cache/
Disallow /fr
Disallow /es

npbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

larbin

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

superbot

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

henrythemiragorobot

Rule Path
Disallow /

bumblebee@relevare.com

Rule Path
Disallow /

true_robot/1.0ll

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

voyager/1.0

Rule Path
Disallow /

mj12bot/v1.0.7 (http://majestic12.co.uk/bot.php?+) [^] [^]

Rule Path
Disallow /

aspider/0.09

Rule Path
Disallow /

appie 1.1 (www.walhello.com)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.luise-berlin.com/sitemap.xml
sitemap https://www.luise-berlin.com/en/sitemap.xml