doterra.com
robots.txt

Robots Exclusion Standard data for doterra.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	doterra.com
Base Domain	doterra.com
Scan Status	Ok
Last Scan	2024-11-07T01:16:37+00:00
Next Scan	2024-12-07T01:16:37+00:00

Last Scan

Scanned	2024-11-07T01:16:37+00:00
URL	https://doterra.com/robots.txt
Redirect	https://www.doterra.com/robots.txt
Redirect Domain	www.doterra.com
Redirect Base	doterra.com
Domain IPs	45.60.102.13, 45.60.12.13
Redirect IPs	45.60.16.13
Response IP	45.60.16.13
Found	Yes
Hash	f11b7c0e6a22e1b10250b9a5e8533424d1e1bf697bb6fc794ccdabc0dec66d85
SimHash	ec405f9cefe0

Groups

*

Rule	Path
Disallow
Disallow	/US/en/cart
Disallow	/US/en/checkout
Disallow	/US/en/my-account

Rule

Path

Disallow

/US/en/cart

Disallow

/US/en/checkout

Disallow

/US/en/my-account

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	http://www.doterra.com/sitemap.xml

Field

Value

sitemap

http://www.doterra.com/sitemap.xml

Back to top

Comments

For all robots
Block access to specific groups of pages
Request-rate: 1/10 # maximum rate is one page every 10 seconds
Crawl-delay: 10 # 10 seconds between page requests
Visit-time: 0400-0845 # only visit between 04:00 and 08:45 UTC
Allow search crawlers to discover the sitemap
Sitemap: /US/en/sitemap.xml
Block CazoodleBot as it does not present correct accept content headers
Block MJ12bot as it is just noise
Block dotbot as it cannot parse base urls properly
Block Gigabot

Back to top

doterra.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

cazoodlebot

mj12bot

dotbot/1.0

gigabot

Other Records

Comments

doterra.com
robots.txt