www.ocean.washington.edu
robots.txt

Robots Exclusion Standard data for www.ocean.washington.edu

Resource Scan

Scan Details

Site Domain www.ocean.washington.edu
Base Domain washington.edu
Scan Status Ok
Last Scan2025-10-23T15:36:40+00:00
Next Scan 2025-11-22T15:36:40+00:00

Last Scan

Scanned2025-10-23T15:36:40+00:00
URL https://www.ocean.washington.edu/robots.txt
Domain IPs 128.208.60.155
Response IP 128.208.60.155
Found Yes
Hash b64e3f74d0cc010e2d1d2496b7b551f611a42e823db32cc603c7c90fa441053f
SimHash c9c94d4f4e27

Groups

googlebot

Rule Path
Allow /

*

Rule Path Comment
Disallow / -
Disallow /cgi-bin/ script files
Disallow /usage/ not worth indexing
Disallow /test/ not worth indexing
Disallow /bin/ web management tools
Disallow /index.html.9* backup copies
Disallow /news/ changes too frequently
Disallow /people/grads/vbhat/noindex/ -
Disallow /computing/ not accessible off campus
Disallow /analog.images -
Disallow /construction -
Disallow /cosee -
Disallow /dabob -
Disallow /data -
Disallow /docs -
Disallow /employment -
Disallow /education -
Disallow /exploraquarium -
Disallow /fluids -
Disallow /gec -
Disallow /general -
Disallow /gifs -
Disallow /gis -
Disallow /htbook -
Disallow /hydrothermalvents -
Disallow /info -
Disallow /lab -
Disallow /manual -
Disallow /mcduff -
Disallow /neptune -
Disallow /new -
Disallow /new_building -
Disallow /news -
Disallow /nsf -
Disallow /ocean_news -
Disallow /ocean_web -
Disallow /ocean_web2 -
Disallow /old -
Disallow /oldeducation -
Disallow /oldfacilities -
Disallow /openhouse2001 -
Disallow /orgs -
Disallow /osb -
Disallow /ots -
Disallow /outreach -
Disallow /ow -
Disallow /pcc -
Disallow /phys -
Disallow /piranha -
Disallow /prchs -
Disallow /puget -
Disallow /research -
Disallow /revelhide -
Disallow /rise -
Disallow /scigifs -
Disallow /services -
Disallow /ships -
Disallow /sos -
Disallow /ssd -
Disallow /temp -
Disallow /test -
Disallow /tsunamissh2_files -
Disallow /ugforum -
Disallow /www -
Disallow /uwlinks -

Comments

  • robots.txt for http://cerveza.ocean.washington.edu/
  • see <http://web.nexor.co.uk/mak/doc/robots/norobots.html> for an explanation

Warnings

  • 3 invalid lines.