superbrightleds.com
robots.txt

Robots Exclusion Standard data for superbrightleds.com

Resource Scan

Scan Details

Site Domain superbrightleds.com
Base Domain superbrightleds.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-12T04:49:32+00:00
Next Scan 2025-10-26T04:49:32+00:00

Last Successful Scan

Scanned2025-09-03T01:42:23+00:00
URL https://superbrightleds.com/robots.txt
Redirect https://www.superbrightleds.com/robots.txt
Redirect Domain www.superbrightleds.com
Redirect Base superbrightleds.com
Domain IPs 146.75.45.124
Redirect IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 146.75.45.124
Found Yes
Hash f17f4586a37aaf7a7829abcb3321c96e924675372198329670f12a3eeeb36ceb
SimHash 6d34d902cbcf

Groups

gptbot

Rule Path
Disallow /checkout/

bingbot

Rule Path
Disallow /checkout/

applebot

Rule Path
Disallow /checkout/

ahrefsbot

Rule Path
Disallow /checkout/

claudebot

Rule Path
Disallow /checkout/

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalogsearch/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /review/product/listAjax/id/
Disallow /*.php$
Disallow /*?SID=
Disallow /home2
Disallow /jacks-sandbox
Disallow /jakes-sandbox

Other Records

Field Value
sitemap https://www.superbrightleds.com/media/google_sitemap_1.xml

Comments

  • Paths (clean URLs)
  • Disallowing this path is not recommended as many sitemaps will contain this path instead of a custom URL key.
  • Disallow: /catalog/product/view/
  • Do not index session ID
  • CMS Pages we don't want index
  • Content sandboxes so we can preview stuff before adding blocks to many pages but don't want index.