wallake.org
robots.txt

Robots Exclusion Standard data for wallake.org

Resource Scan

Scan Details

Site Domain wallake.org
Base Domain wallake.org
Scan Status Ok
Last Scan2024-09-30T20:26:02+00:00
Next Scan 2024-10-07T20:26:02+00:00

Last Scan

Scanned2024-09-30T20:26:02+00:00
URL https://wallake.org/robots.txt
Domain IPs 91.222.237.62
Response IP 91.222.237.62
Found Yes
Hash 782e636589e97c65b9290ec2e7e1159f2c97683ff8d0a4e721be271fed2ddd20
SimHash 4c40ea50cf33

Groups

yandex
*

Rule Path
Disallow /*embed/
Disallow /*%40
Disallow /*%21
Disallow /*?
Disallow /*%26
Allow /engine/classes/min/index.php?*
Allow /templates/*?
Allow /uploads/*?

yandex

Rule Path
Disallow /en/
Disallow /de/
Disallow /es/
Disallow /fr/

Other Records

Field Value
sitemap https://wallake.org/sitemap.xml

Warnings

  • `host` is not a known field.