bouldergardens.com
robots.txt

Robots Exclusion Standard data for bouldergardens.com

Resource Scan

Scan Details

Site Domain bouldergardens.com
Base Domain bouldergardens.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-28T13:29:29+00:00
Next Scan 2024-06-11T13:29:29+00:00

Last Successful Scan

Scanned2024-04-20T08:33:32+00:00
URL https://bouldergardens.com/robots.txt
Domain IPs 216.137.39.105, 216.137.39.41, 216.137.39.47, 216.137.39.49
Response IP 18.165.171.35
Found Yes
Hash 6bc1062acc455157d4b2fb6435ff93ca364e29904e3bb8c0de1a8478628348c3
SimHash cc3dd132ce31

Groups

blexbot
seznambot
ccbot
spbot
semrushbot
mj12bot
baiduspider
yandex
mauibot
linguee

Rule Path
Disallow /

*

Rule Path
Disallow /catalogsearch
Disallow /api/
Disallow /checkout/
Disallow /customer/
Disallow /dashboard/
Disallow /index.php/
Disallow /fcc/
Allow /customer/account/login/
Disallow *%26amp%3B*
Allow /*?p=
Disallow /*?

storebot-google

Rule Path
Disallow /catalogsearch
Disallow /api/
Disallow /customer/
Disallow /dashboard/
Disallow /index.php/
Disallow /fcc/
Allow /customer/account/login/
Disallow *%26amp%3B*
Allow /*?p=
Disallow /*?

Other Records

Field Value
sitemap https://bouldergardens.com/sitemap_index.xml

Comments

  • Disallow SalesForce Marketing Cloud links that appear to escape URLs
  • Disallow platform-specific URLs that come across as a query string,
  • but allow pagination (`?p=[d]`)