health.harvard.edu
robots.txt

Robots Exclusion Standard data for health.harvard.edu

Resource Scan

Scan Details

Site Domain health.harvard.edu
Base Domain harvard.edu
Scan Status Ok
Last Scan2024-05-22T22:21:52+00:00
Next Scan 2024-06-21T22:21:52+00:00

Last Scan

Scanned2024-05-22T22:21:52+00:00
URL https://health.harvard.edu/robots.txt
Redirect https://www.health.harvard.edu/robots.txt
Redirect Domain www.health.harvard.edu
Redirect Base harvard.edu
Domain IPs 54.165.240.143
Redirect IPs 54.165.240.143
Response IP 54.165.240.143
Found Yes
Hash ac076899b174da1f4a38091b39e4f3dca059a2c105066bd63f544ae52eb17205
SimHash a014d820e153

Groups

gptbot
google-extended

Rule Path
Disallow /