hmc.edu
robots.txt

Robots Exclusion Standard data for hmc.edu

Resource Scan

Scan Details

Site Domain hmc.edu
Base Domain hmc.edu
Scan Status Ok
Last Scan2025-08-04T18:04:48+00:00
Next Scan 2025-09-03T18:04:48+00:00

Last Scan

Scanned2025-08-04T18:04:48+00:00
URL https://hmc.edu/robots.txt
Redirect https://www.hmc.edu/robots.txt
Redirect Domain www.hmc.edu
Redirect Base hmc.edu
Domain IPs 66.33.202.112
Redirect IPs 66.33.202.112
Response IP 66.33.202.112
Found Yes
Hash e5a3bca71b90665eff14c7c1ef8a6d46bd2c8d4840fc8b2d71f96785756ebc10
SimHash d904c090c283

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /admin/
Disallow /admission/alumni-ambassadors/
Disallow /parents/messages-from-president-klawe/
Disallow /non-wp-sites/old-news/

googlebot-image

Rule Path
Disallow /wp-content/themes/hmc-core/images/seal.png

amazonbot

Rule Path
Disallow /