library.ucsd.edu
robots.txt

Robots Exclusion Standard data for library.ucsd.edu

Resource Scan

Scan Details

Site Domain library.ucsd.edu
Base Domain ucsd.edu
Scan Status Ok
Last Scan2024-06-01T08:12:51+00:00
Next Scan 2024-07-01T08:12:51+00:00

Last Scan

Scanned2024-06-01T08:12:51+00:00
URL https://library.ucsd.edu/robots.txt
Domain IPs 132.239.119.5
Response IP 132.239.119.5
Found Yes
Hash 406320336e830d1cbd4d11b9981c9d5ed87586b6c51db3f55bb82db3773cb6ce
SimHash 2109eef0af1d

Groups

mauibot

Rule Path
Disallow /dc/

*

Rule Path
Disallow /chronopolis-staging/
Disallow /sdta-staging/
Disallow /lpw-staging/
Disallow /dc/search

Other Records

Field Value
sitemap https://library.ucsd.edu/sitemap.xml