amahaarden.com
robots.txt

Robots Exclusion Standard data for amahaarden.com

Resource Scan

Scan Details

Site Domain amahaarden.com
Base Domain amahaarden.com
Scan Status Ok
Last Scan2026-01-27T04:38:15+00:00
Next Scan 2026-02-03T04:38:15+00:00

Last Scan

Scanned2026-01-27T04:38:15+00:00
URL https://amahaarden.com/robots.txt
Redirect https://www.amahaarden.com/robots.txt
Redirect Domain www.amahaarden.com
Redirect Base amahaarden.com
Domain IPs 2a00:1c98:1000:1172:0:2:af2:ea0d, 5.134.4.110
Redirect IPs 2a00:1c98:1000:1172:0:2:af2:ea0d, 5.134.4.110
Response IP 5.134.4.110
Found Yes
Hash d420c14c5b76066bae67d5a699b872408a41bc700afc66f4f058190d9d3770d0
SimHash 41381d5637b3

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.amahaarden.com/nl/sitemaps-2-sitemap.xml

Comments

  • robots.txt for https://www.amahaarden.com/nl/
  • live - don't allow web crawlers to index cpresources/ or vendor/