lib.washington.edu
robots.txt

Robots Exclusion Standard data for lib.washington.edu

Resource Scan

Scan Details

Site Domain lib.washington.edu
Base Domain washington.edu
Scan Status Ok
Last Scan2024-05-22T19:00:10+00:00
Next Scan 2024-06-21T19:00:10+00:00

Last Scan

Scanned2024-05-22T19:00:10+00:00
URL https://lib.washington.edu/robots.txt
Domain IPs 128.95.104.140
Response IP 128.95.104.140
Found Yes
Hash 1dbaf791f11b25dc8ff622ba43870bc5a02cb66c20718747b2a8fe8e46312096
SimHash 9a98532f8f56

Groups

*

Rule Path
Disallow /test/
Disallow /archive/
Disallow /scripts/
Disallow *.bak
Disallow /css/
Disallow /inc/
Disallow /resource/
Disallow /digitalregistry/
Disallow /cproxy.pac

ultraseek

Rule Path
Disallow /test/
Disallow /scripts/
Disallow /isapi/
Disallow /samples/
Disallow /srchadm/
Disallow /Archive/
Disallow *.bak
Disallow /css/
Disallow /inc/
Disallow /resource/

browsershots

Rule Path
Disallow

bubing

Rule Path
Disallow /

Comments

  • robots.txt for www.lib.washington.edu
  • stay out of test space
  • uncommented next line 5-7-99 to see if it stops the ssi
  • crashes on saturday
  • recommented 8-30-99
  • Disallow: /