lightart.com
robots.txt

Robots Exclusion Standard data for lightart.com

Resource Scan

Scan Details

Site Domain lightart.com
Base Domain lightart.com
Scan Status Ok
Last Scan2026-02-15T18:50:13+00:00
Next Scan 2026-02-22T18:50:13+00:00

Last Scan

Scanned2026-02-15T18:50:13+00:00
URL https://lightart.com/robots.txt
Domain IPs 35.95.54.49
Response IP 35.95.54.49
Found Yes
Hash 0ebeebc6722711d0ecd254937bd0ce780131853e248d691ddf5562d02208a0fc
SimHash 41681d523d13

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://lightart.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://lightart.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/