execonline.hms.harvard.edu
robots.txt

Robots Exclusion Standard data for execonline.hms.harvard.edu

Resource Scan

Scan Details

Site Domain execonline.hms.harvard.edu
Base Domain harvard.edu
Scan Status Ok
Last Scan2025-07-02T07:07:22+00:00
Next Scan 2025-08-01T07:07:22+00:00

Last Scan

Scanned2025-07-02T07:07:22+00:00
URL https://execonline.hms.harvard.edu/robots.txt
Domain IPs 104.18.14.216, 104.18.15.216, 2606:4700::6812:ed8, 2606:4700::6812:fd8
Response IP 104.18.14.216
Found Yes
Hash 3c33029cb47e150a57ad4f81d1bf772b13c78bf04530d430d19ccee525b293c8
SimHash b9569d87de72

Groups

*

Rule Path
Disallow /admin/
Disallow /admin_users/
Disallow /users/
Disallow /saml/
Disallow /enterprise_admin/
Disallow /revision_previews/
Disallow /comparison/
Disallow /rails/active_storage/blobs/*
Disallow /programs/*/brochure
Disallow /$
Disallow /users/sign_in
Disallow /users/sign_up

triplecheckerrobot

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file