inst.eecs.berkeley.edu
robots.txt

Robots Exclusion Standard data for inst.eecs.berkeley.edu

Resource Scan

Scan Details

Site Domain inst.eecs.berkeley.edu
Base Domain berkeley.edu
Scan Status Ok
Last Scan2025-06-01T21:05:20+00:00
Next Scan 2025-07-01T21:05:20+00:00

Last Scan

Scanned2025-06-01T21:05:20+00:00
URL https://inst.eecs.berkeley.edu/robots.txt
Domain IPs 128.32.42.199
Response IP 128.32.42.199
Found Yes
Hash f20df6ced3132c7e65bac535e1c938e4146c5520637504d3bf72a0ff2e302e20
SimHash 2580416c8117

Groups

*

Rule Path Comment
Disallow /images/ -
Disallow /roster/ -
Disallow /roster2/ -
Disallow /testphp/ -
Disallow /php_manual/ -
Disallow /postgres/ -
Disallow /manual/ -
Disallow /logs/ -
Disallow /cgi-bin/ -
Disallow /usr/local/apache2/manual added by kevinm 2/1/06
Disallow /~scheme -