luckau.de
robots.txt

Robots Exclusion Standard data for luckau.de

Resource Scan

Scan Details

Site Domain luckau.de
Base Domain luckau.de
Scan Status Ok
Last Scan2025-11-02T17:43:48+00:00
Next Scan 2025-12-02T17:43:48+00:00

Last Scan

Scanned2025-11-02T17:43:48+00:00
URL https://luckau.de/robots.txt
Domain IPs 178.250.10.88
Response IP 178.250.10.88
Found Yes
Hash dbf0c37b144727b48449c7771b4cee450d807c38263b42b0cda6bef3e2cf9ed5
SimHash 69540b126333

Groups

*

Rule Path
Disallow /visioncontent/

surveybot

Rule Path
Disallow /

websitewiki

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

jakarta commons-httpclient

Rule Path
Disallow /

aboutusbot

Rule Path
Disallow /

semager

Rule Path
Disallow /

Comments

  • www.robotstxt.org/
  • Allow crawling of all content
  • D2