root.irobot.com
robots.txt

Robots Exclusion Standard data for root.irobot.com

Resource Scan

Scan Details

Site Domain root.irobot.com
Base Domain irobot.com
Scan Status Ok
Last Scan2025-07-02T02:12:13+00:00
Next Scan 2025-07-16T02:12:13+00:00

Last Scan

Scanned2025-07-02T02:12:13+00:00
URL https://root.irobot.com/robots.txt
Redirect https://www.root.irobot.com/robots.txt
Redirect Domain www.root.irobot.com
Redirect Base irobot.com
Domain IPs 50.16.35.210
Redirect IPs 54.227.64.114
Response IP 54.227.64.114
Found Yes
Hash b12ea9477e4f641eee13b5f8585566be6958d118cb924334e9bc36f5952cfc6e
SimHash 43389f122fb2

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /assets/materials/

Other Records

Field Value
sitemap https://edu.irobot.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://edu.irobot.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/