hir.harvard.edu
robots.txt

Robots Exclusion Standard data for hir.harvard.edu

Resource Scan

Scan Details

Site Domain hir.harvard.edu
Base Domain harvard.edu
Scan Status Ok
Last Scan2024-10-28T11:26:11+00:00
Next Scan 2024-11-27T11:26:11+00:00

Last Scan

Scanned2024-10-28T11:26:11+00:00
URL https://hir.harvard.edu/robots.txt
Domain IPs 134.209.72.237
Response IP 134.209.72.237
Found Yes
Hash 8c92bfa6a988dd9a261fdd891ada4a06c9533d11fa97e89831032c6502277138
SimHash e01c7505af93

Groups

*

Rule Path
Disallow /ghost/
Disallow /p/
Disallow /email/
Disallow /r/

Other Records

Field Value
sitemap https://hir.harvard.edu/sitemap.xml