commercialledlights.com
robots.txt

Robots Exclusion Standard data for commercialledlights.com

Resource Scan

Scan Details

Site Domain commercialledlights.com
Base Domain commercialledlights.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-17T12:08:42+00:00
Next Scan 2026-03-17T12:08:42+00:00

Last Successful Scan

Scanned2025-01-28T23:39:56+00:00
URL https://commercialledlights.com/robots.txt
Domain IPs 104.26.6.59, 104.26.7.59, 172.67.71.11, 2606:4700:20::681a:63b, 2606:4700:20::681a:73b, 2606:4700:20::ac43:470b
Response IP 172.67.71.11
Found Yes
Hash df532871c5eba850cfdf023357b4d7df385abe6d9b18e6b64df99ba9b914b0fa
SimHash 4135d912c3d0

Groups

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

seekportbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

geedoproductsearch

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path Comment
Allow /*?p= -
Allow /*?%2B -
Allow /*?utm_source= -
Allow *.css -
Allow *.js global
Disallow /index.php/ -
Disallow /app/ -
Disallow /lib/ -
Disallow /*.php$ -
Disallow /pkginfo/ -
Disallow /report/ -
Disallow /var/ -
Disallow /catalog/ -
Disallow /catalogsearch/ -
Disallow /customer/ -
Disallow /sendfriend/ -
Disallow /review/ -
Disallow /*SID%3D -

Other Records

Field Value
crawl-delay 10

Comments

  • Disallow: /*?* # Disallow all URLs with a query string

Warnings

  • `noindex` is not a known field.