luluslly.com
robots.txt

Robots Exclusion Standard data for luluslly.com

Resource Scan

Scan Details

Site Domain luluslly.com
Base Domain luluslly.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-04T15:46:38+00:00
Next Scan 2025-11-02T15:46:38+00:00

Last Successful Scan

Scanned2023-06-24T12:45:26+00:00
URL https://luluslly.com/robots.txt
Redirect https://www.luluslly.com/robots.txt
Redirect Domain www.luluslly.com
Redirect Base luluslly.com
Domain IPs 35.165.136.90
Redirect IPs 104.18.128.14, 104.18.129.14, 2606:4700::6812:800e, 2606:4700::6812:810e
Response IP 104.18.129.14
Found Yes
Hash 7624671b4e3ddcfdd1c92c464a0368cfd0c025f01ba869225faa62016f8c4c23
SimHash 6d149d6a71d0

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /search
Disallow /openapi
Disallow /*preview_theme_id*
Disallow /cdn-cgi

adsbot-google

Rule Path
Disallow /checkout
Disallow /cart
Disallow /orders
Disallow /*preview_theme_id*
Disallow /cdn-cgi

pinterest

Rule Path
Disallow /api/cart*

Other Records

Field Value
crawl-delay 1

pinterestbot

Rule Path
Disallow /api/cart*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.luluslly.com/sitemap.xml.gz
sitemap https://www.luluslly.com/sitemap.xml.gz

Comments

  • Google adsbot ignores robots.txt unless specifically named!