diodedrive.com
robots.txt

Robots Exclusion Standard data for diodedrive.com

Resource Scan

Scan Details

Site Domain diodedrive.com
Base Domain diodedrive.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-11T00:28:28+00:00
Next Scan 2025-11-10T00:28:28+00:00

Last Successful Scan

Scanned2025-08-19T15:54:16+00:00
URL https://diodedrive.com/robots.txt
Redirect https://www.diodedrive.com/robots.txt
Redirect Domain www.diodedrive.com
Redirect Base diodedrive.com
Domain IPs 146.75.45.124
Redirect IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 199.232.45.124
Found Yes
Hash 78828bfc3c2d2b1f73ebad24e631d22b47c76d47c5089a02144989e5aeadcfbb
SimHash 6d35d102cbef

Groups

gptbot

Rule Path
Disallow /checkout/

bingbot

Rule Path
Disallow /checkout/

applebot

Rule Path
Disallow /checkout/

ahrefsbot

Rule Path
Disallow /checkout/

claudebot

Rule Path
Disallow /checkout/

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalogsearch/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /review/product/listAjax/id/
Disallow /*.php$
Disallow /*?SID=
Disallow /home2
Disallow /jacks-sandbox
Disallow /jakes-sandbox

Other Records

Field Value
sitemap https://www.diodedrive.com/media/google_sitemap_2.xml

Comments

  • Paths (clean URLs)
  • Disallowing this path is not recommended as many sitemaps will contain this path instead of a custom URL key.
  • Disallow: /catalog/product/view/
  • Do not index session ID
  • CMS Pages we don't want index
  • Content sandboxes so we can preview stuff before adding blocks to many pages but don't want index.