lhacc.org
robots.txt

Robots Exclusion Standard data for lhacc.org

Resource Scan

Scan Details

Site Domain lhacc.org
Base Domain lhacc.org
Scan Status Ok
Last Scan2026-02-06T12:35:29+00:00
Next Scan 2026-03-08T12:35:29+00:00

Last Scan

Scanned2026-02-06T12:35:29+00:00
URL https://lhacc.org/robots.txt
Redirect https://www.lhacc.org/robots.txt
Redirect Domain www.lhacc.org
Redirect Base lhacc.org
Domain IPs 199.34.228.164
Redirect IPs 199.34.228.164
Response IP 199.34.228.164
Found Yes
Hash 0863351e55ebd8b11d2aa741618790d5efd0dc40c100c12595804742a60045b7
SimHash ec289804fa93

Groups

*

Rule Path
Disallow /s/search
Disallow /s/cart/
Disallow /s/checkout/
Disallow /store/checkout
Disallow /store/status
Disallow /product/*/*/leave-review

Other Records

Field Value
crawl-delay 5

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.lhacc.org/sitemap.xml