web.mit.edu
robots.txt

Robots Exclusion Standard data for web.mit.edu

Resource Scan

Scan Details

Site Domain web.mit.edu
Base Domain mit.edu
Scan Status Ok
Last Scan2024-10-30T23:17:17+00:00
Next Scan 2024-11-29T23:17:17+00:00

Last Scan

Scanned2024-10-30T23:17:17+00:00
URL https://web.mit.edu/robots.txt
Domain IPs 173.222.144.77, 2600:1413:1:482::255e, 2600:1413:1:49a::255e
Response IP 104.69.152.71
Found Yes
Hash ca2d90cb204493d02cca95f997ecf03c181ea5b306756c4fb759d7cebfc5c6a5
SimHash 3c164745c793

Groups

*

Rule Path
Disallow /afs/
Disallow /cgi-bin/
Disallow /user/
Disallow /org/
Disallow /activity/
Disallow /contrib/
Disallow /dept/
Disallow /software/
Disallow /bin/