environment.research.yale.edu
robots.txt

Robots Exclusion Standard data for environment.research.yale.edu

Resource Scan

Scan Details

Site Domain environment.research.yale.edu
Base Domain yale.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-03-03T04:17:48+00:00
Next Scan 2024-06-01T04:17:48+00:00

Last Successful Scan

Scanned2022-10-07T12:50:48+00:00
URL http://environment.research.yale.edu/robots.txt
Response IP 23.185.0.4
Found Yes
Hash 885ece1835d3d7106496ced8aab5b58cd83465ed3d8b7ee8a5dab687d79622e8
SimHash 8808db22a6a2

Groups

*

Rule Path
Disallow /

ravencrawler
rogerbot
dotbot
semrushbot
semrushbot-sa
powermapper
swiftbot

Rule Path
Allow /

Comments

  • Pantheon's documentation on robots.txt: https://pantheon.io/docs/bots-and-indexing/