4d.gsm.cornell.edu
robots.txt

Robots Exclusion Standard data for 4d.gsm.cornell.edu

Resource Scan

Scan Details

Site Domain 4d.gsm.cornell.edu
Base Domain cornell.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-20T05:52:13+00:00
Next Scan 2024-05-20T05:52:13+00:00

Last Successful Scan

Scanned2024-02-28T05:51:20+00:00
URL https://4d.gsm.cornell.edu/robots.txt
Domain IPs 23.185.0.3, 2620:12a:8000::3, 2620:12a:8001::3
Response IP 23.185.0.3
Found Yes
Hash b098941f91fefc0d00012abede2e6459b9fab7d46d16e88ad14468bb0ac6cf5a
SimHash 9848d322b662

Groups

*

Rule Path
Disallow /

ravencrawler
rogerbot
dotbot
semrushbot
siteauditbot
splitsignalbot
powermapper
swiftbot
lyticsbot
dubbotbot

Rule Path
Allow /

Comments

  • Pantheon's documentation on robots.txt: https://pantheon.io/docs/bots-and-indexing/