cdip.ucsd.edu
robots.txt

Robots Exclusion Standard data for cdip.ucsd.edu

Resource Scan

Scan Details

Site Domain cdip.ucsd.edu
Base Domain ucsd.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-24T15:58:27+00:00
Next Scan 2025-10-22T15:58:27+00:00

Last Successful Scan

Scanned2025-03-03T18:44:02+00:00
URL https://cdip.ucsd.edu/robots.txt
Domain IPs 169.228.225.137
Response IP 169.228.225.137
Found Yes
Hash 78749da7bb0f17bfccfdacdf741933410f5f31d87d5dae649edc2a2977ee9890
SimHash 61428033a5d0

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /sand/
Disallow /gifs/
Disallow /jpgs/
Disallow /pngs/
Disallow /offline/
Disallow /models/
Disallow /cdip_htmls/
Disallow /elnino_htmls/
Disallow /themes_dev/
Disallow /themes/data/download/
Disallow /*?*end=
Disallow /*?*d999=
Disallow /*?*%3Adt%3A

Other Records

Field Value
crawl-delay 30

bingbot

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /sand/
Disallow /gifs/
Disallow /jpgs/
Disallow /pngs/
Disallow /offline/
Disallow /models/
Disallow /cdip_htmls/
Disallow /elnino_htmls/
Disallow /themes_dev/
Disallow /themes/data/download/
Disallow /*?*end=
Disallow /*?*d999=
Disallow /*?*%3Adt%3A

Other Records

Field Value
crawl-delay 30

Comments

  • Allow Googlebot but restrict certain directories
  • Allow Bingbot but restrict certain directories