mit.edu
robots.txt

Robots Exclusion Standard data for mit.edu

Resource Scan

Scan Details

Site Domain mit.edu
Base Domain mit.edu
Scan Status Ok
Last Scan2024-11-13T12:09:04+00:00
Next Scan 2024-11-20T12:09:04+00:00

Last Scan

Scanned2024-11-13T12:09:04+00:00
URL https://mit.edu/robots.txt
Redirect https://web.mit.edu/robots.txt
Redirect Domain web.mit.edu
Redirect Base mit.edu
Domain IPs 173.222.144.77, 2600:1413:b000:797::255e, 2600:1413:b000:798::255e
Redirect IPs 173.222.144.77, 2600:1413:b000:797::255e, 2600:1413:b000:798::255e
Response IP 104.69.40.155
Found Yes
Hash ca2d90cb204493d02cca95f997ecf03c181ea5b306756c4fb759d7cebfc5c6a5
SimHash 3c164745c793

Groups

*

Rule Path
Disallow /afs/
Disallow /cgi-bin/
Disallow /user/
Disallow /org/
Disallow /activity/
Disallow /contrib/
Disallow /dept/
Disallow /software/
Disallow /bin/