www.cgl.ucsf.edu
robots.txt

Robots Exclusion Standard data for www.cgl.ucsf.edu

Resource Scan

Scan Details

Site Domain www.cgl.ucsf.edu
Base Domain ucsf.edu
Scan Status Ok
Last Scan2025-07-02T16:40:41+00:00
Next Scan 2025-08-01T16:40:41+00:00

Last Scan

Scanned2025-07-02T16:40:41+00:00
URL https://www.cgl.ucsf.edu/robots.txt
Domain IPs 169.230.27.29
Response IP 169.230.27.29
Found Yes
Hash c3e4fb6f0584624cadd80b1b2d9e0f9b18931ada8f4153c85380a2070a461776
SimHash d01191e495b1

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /icons
Disallow /reports
Disallow /home/gregc/ggsp
Disallow /home/sparky/python-doc
Disallow /trac
Disallow /stats
Disallow /tef
Disallow /gregc
Disallow /bmi206
Disallow /Outreach/bmi206