thesimplepiece.com
robots.txt

Robots Exclusion Standard data for thesimplepiece.com

Resource Scan

Scan Details

Site Domain thesimplepiece.com
Base Domain thesimplepiece.com
Scan Status Ok
Last Scan2025-09-21T23:54:05+00:00
Next Scan 2025-10-21T23:54:05+00:00

Last Scan

Scanned2025-09-21T23:54:05+00:00
URL https://thesimplepiece.com/robots.txt
Redirect https://thesimplepiece.easy.co/robots.txt
Redirect Domain thesimplepiece.easy.co
Redirect Base easy.co
Domain IPs 104.21.63.189, 172.67.171.178, 2606:4700:3035::6815:3fbd, 2606:4700:3037::ac43:abb2
Redirect IPs 13.35.37.64, 13.35.37.90, 13.35.37.92, 13.35.37.99
Response IP 13.35.37.90
Found Yes
Hash 39ae6e2f9f89f5ae61fa9cb0574f4df5724244cf0771c60ff8c13bc1afffa764
SimHash 46149d767451

Groups

*

Rule Path
Disallow /a/
Disallow /account
Disallow /api
Disallow /apps/
Disallow /cart
Disallow /checkout
Disallow /community/
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /tools/
Disallow /*preview_script_id*
Disallow /*preview_theme_id*
Disallow /apple-app-site-association

adsbot-google
googlebot
googlebot-image

Rule Path
Disallow /api
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /*preview_theme_id*
Disallow /*preview_script_id*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://thesimplepiece.easy.co/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!
  • Explicitly state Googlebot & Googlebot-Image to try Google Shopping