honeylove.com
robots.txt

Robots Exclusion Standard data for honeylove.com

Resource Scan

Scan Details

Site Domain honeylove.com
Base Domain honeylove.com
Scan Status Ok
Last Scan2024-11-15T07:47:49+00:00
Next Scan 2024-11-29T07:47:49+00:00

Last Scan

Scanned2024-11-15T07:47:49+00:00
URL https://honeylove.com/robots.txt
Redirect https://www.honeylove.com/robots.txt
Redirect Domain www.honeylove.com
Redirect Base honeylove.com
Domain IPs 23.227.38.32
Redirect IPs 23.227.38.74, 2620:127:f00f:e::
Response IP 23.227.38.74
Found Yes
Hash 784e51520274e1be986ea753488b32226cc236c9c51c8cf16f10947971eab469
SimHash 6515bd62f512

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /carts
Disallow /account
Disallow /profile
Disallow */search
Disallow */track-your-order
Disallow /*?size=
Disallow /*?bundleColor0
Disallow /*?okeReviewsNextUrl=
Disallow /stores
Disallow /hc/en-us/search
Disallow /en-*
Allow /en-us
Allow /en-gb
Allow /en-ca
Allow /en-au

adsbot-google

Rule Path
Disallow /checkout
Disallow /carts
Disallow /orders
Disallow /*/src/tree
Disallow /*/traverser

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.honeylove.com/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!