missbeehavenflorist.com
robots.txt

Robots Exclusion Standard data for missbeehavenflorist.com

Resource Scan

Scan Details

Site Domain missbeehavenflorist.com
Base Domain missbeehavenflorist.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-28T11:48:16+00:00
Next Scan 2024-06-11T11:48:16+00:00

Last Successful Scan

Scanned2024-04-20T08:33:34+00:00
URL https://missbeehavenflorist.com/robots.txt
Domain IPs 18.155.192.117, 18.155.192.44, 18.155.192.54, 18.155.192.56
Response IP 18.165.171.35
Found Yes
Hash c1bcc0b9c2252ceb1d11c13113de40cce061798f246261625b434f649cea3fd8
SimHash cc3cd032ce71

Groups

blexbot
seznambot
ccbot
spbot
semrushbot
mj12bot
baiduspider
yandex
mauibot
linguee

Rule Path
Disallow /

*

Rule Path
Disallow /catalogsearch
Disallow /api/
Disallow /checkout/
Disallow /customer/
Disallow /dashboard/
Disallow /index.php/
Disallow /fcc/
Allow /customer/account/login/
Disallow *%26amp%3B*
Allow /*?p=
Disallow /*?

storebot-google

Rule Path
Disallow /catalogsearch
Disallow /api/
Disallow /customer/
Disallow /dashboard/
Disallow /index.php/
Disallow /fcc/
Allow /customer/account/login/
Disallow *%26amp%3B*
Allow /*?p=
Disallow /*?

Other Records

Field Value
sitemap https://missbeehavenflorist.com/sitemap_index.xml

Comments

  • Disallow SalesForce Marketing Cloud links that appear to escape URLs
  • Disallow platform-specific URLs that come across as a query string,
  • but allow pagination (`?p=[d]`)