simplyearth.com
robots.txt

Robots Exclusion Standard data for simplyearth.com

Resource Scan

Scan Details

Site Domain simplyearth.com
Base Domain simplyearth.com
Scan Status Ok
Last Scan2026-02-05T04:46:15+00:00
Next Scan 2026-03-07T04:46:15+00:00

Last Scan

Scanned2026-02-05T04:46:15+00:00
URL https://simplyearth.com/robots.txt
Domain IPs 104.26.12.95, 104.26.13.95, 172.67.71.113, 2606:4700:20::681a:c5f, 2606:4700:20::681a:d5f, 2606:4700:20::ac43:4771
Response IP 172.67.71.113
Found Yes
Hash 875a0104ff429bd1a4722854dee2d3ef3c64ebbb9af3f8a23e712473948a580d
SimHash 6917ebf70f17

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

*

Rule Path
Disallow /products/retail-starter-pack-2
Disallow /products/retail-starter-pack-1
Disallow /products/display-box-promotional-materials
Disallow /products/honest-price-card-promotional-materials
Disallow /products/big-bonus-box
Disallow /products/starter-retail
Disallow /products/wholesale-welcome-packet
Disallow /products/essential-oils-for-kids
Disallow /products/stone-diffuser-plus-blend
Disallow /products/essential-oils-for-diffusing
Disallow /products/wholesale-car-diffuser
Disallow /products/essential-oils-for-cleaning
Disallow /products/massage-kit
Disallow /products/essential-oil-safety-chart
Disallow /products/essential-oil-benefits-chart

Other Records

Field Value
sitemap https://simplyearth.com/sitemap.xml