web.cs.ucla.edu
robots.txt

Robots Exclusion Standard data for web.cs.ucla.edu

Resource Scan

Scan Details

Site Domain web.cs.ucla.edu
Base Domain ucla.edu
Scan Status Ok
Last Scan2025-10-15T09:00:34+00:00
Next Scan 2025-11-14T09:00:34+00:00

Last Scan

Scanned2025-10-15T09:00:34+00:00
URL https://web.cs.ucla.edu/robots.txt
Domain IPs 131.179.128.29
Response IP 131.179.128.29
Found Yes
Hash ec65d0aa35aa5461a826b2664598929ac44c61b7fa0d3cc2344839c6689d2af2
SimHash d100194407d6

Groups

*

Rule Path
Disallow /r_share1
Disallow /cgi-bin

Comments

  • robots.txt for http://www.cs.ucla.edu/