sugarscience.org
robots.txt

Robots Exclusion Standard data for sugarscience.org

Resource Scan

Scan Details

Site Domain sugarscience.org
Base Domain sugarscience.org
Scan Status Ok
Last Scan2025-08-06T00:50:17+00:00
Next Scan 2025-09-05T00:50:17+00:00

Last Scan

Scanned2025-08-06T00:50:17+00:00
URL https://www.sugarscience.org/robots.txt
Redirect http://sugarscience.ucsf.edu/robots.txt
Redirect Domain sugarscience.ucsf.edu
Redirect Base ucsf.edu
Domain IPs 104.21.37.253, 172.67.216.175, 2606:4700:3033::ac43:d8af, 2606:4700:3037::6815:25fd
Redirect IPs 68.66.224.31
Response IP 68.66.224.31
Found Yes
Hash 15584c1b064426ff035bcefc803b23ec91b2b5c43353473dc044b350e0258fd7
SimHash 3348abc64e9a

Groups

*

Rule Path
Disallow /assets/backup/
Disallow /assets/cache/
Disallow /assets/docs/
Disallow /assets/export/
Disallow /assets/import/
Disallow /assets/modules/
Disallow /assets/plugins/
Disallow /assets/snippets/
Disallow /assets/packages/
Disallow /assets/tvs/
Disallow /install/
Allow /assets/cache/images/
Allow /assets/modules/*.css
Allow /assets/modules/*.js
Allow /assets/plugins/*.css
Allow /assets/plugins/*.js
Allow /assets/snippets/*.css
Allow /assets/snippets/*.js

Comments

  • Default modx exclusions
  • Host: example.com
  • For sitemaps.xml autodiscovery. Uncomment if you have one:
  • Sitemap: http://example.com/sitemap.xml