anicetee.com
robots.txt

Robots Exclusion Standard data for anicetee.com

Resource Scan

Scan Details

Site Domain anicetee.com
Base Domain anicetee.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-08-06T13:12:32+00:00
Next Scan 2025-11-04T13:12:32+00:00

Last Successful Scan

Scanned2023-03-23T08:28:32+00:00
URL https://anicetee.com/robots.txt
Redirect https://www.anicetee.com/robots.txt
Redirect Domain www.anicetee.com
Redirect Base anicetee.com
Domain IPs 103.172.191.1
Redirect IPs 104.18.128.14, 104.18.129.14, 2606:4700::6812:800e, 2606:4700::6812:810e
Response IP 104.18.129.14
Found Yes
Hash 2cfc658824780512c470f92a01d517b016f7149444eb97ba7062512455272cc2
SimHash 2d141d6a51c1

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /search
Disallow /openapi
Disallow /*preview_theme_id*
Disallow /cdn-cgi

adsbot-google

Rule Path
Disallow /checkout
Disallow /cart
Disallow /orders
Disallow /*preview_theme_id*
Disallow /cdn-cgi

pinterest

Rule Path
Disallow /api/cart*

Other Records

Field Value
crawl-delay 1

pinterestbot

Rule Path
Disallow /api/cart*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.anicetee.com/sitemap.xml.gz
sitemap https://www.anicetee.com/sitemap.xml.gz

Comments

  • Google adsbot ignores robots.txt unless specifically named!