thewarhaus.com
robots.txt

Robots Exclusion Standard data for thewarhaus.com

Resource Scan

Scan Details

Site Domain thewarhaus.com
Base Domain thewarhaus.com
Scan Status Ok
Last Scan2026-02-22T09:33:08+00:00
Next Scan 2026-03-24T09:33:08+00:00

Last Scan

Scanned2026-02-22T09:33:08+00:00
URL https://thewarhaus.com/robots.txt
Redirect https://www.thewarhaus.com/robots.txt
Redirect Domain www.thewarhaus.com
Redirect Base thewarhaus.com
Domain IPs 199.34.228.162
Redirect IPs 199.34.228.162
Response IP 199.34.228.162
Found Yes
Hash 482a98c47fed8c47e79ec347529457a73e83a46409cd68f2bdf687aaeddcc116
SimHash ec281804f292

Groups

*

Rule Path
Disallow /s/search
Disallow /s/cart/
Disallow /s/checkout/
Disallow /store/checkout
Disallow /store/status
Disallow /product/*/*/leave-review

Other Records

Field Value
crawl-delay 5

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.thewarhaus.com/sitemap.xml