blink.ucsd.edu
robots.txt

Robots Exclusion Standard data for blink.ucsd.edu

Resource Scan

Scan Details

Site Domain blink.ucsd.edu
Base Domain ucsd.edu
Scan Status Ok
Last Scan2025-08-31T04:15:25+00:00
Next Scan 2025-09-30T04:15:25+00:00

Last Scan

Scanned2025-08-31T04:15:25+00:00
URL https://blink.ucsd.edu/robots.txt
Domain IPs 44.242.35.176, 52.26.221.131
Response IP 44.242.35.176
Found Yes
Hash 691a52c61e739b9fb76dc90f752e54fd7b3ffd69f5d3b22c2f3d013460c41e57
SimHash c143f1296c83

Groups

*

Rule Path
Disallow /_files
Disallow /_draft
Disallow /_archive
Disallow /_misc
Disallow /search
Disallow /technology/network/access/ad/post-change-instructions.html
Disallow *.pdf
Disallow *.xls
Disallow *.doc

Other Records

Field Value
sitemap https://blink.ucsd.edu/sitemap.xml
sitemap https://blink.ucsd.edu/_sitemaps/buy-pay.xml
sitemap https://blink.ucsd.edu/_sitemaps/facilities.xml
sitemap https://blink.ucsd.edu/_sitemaps/faculty.xml
sitemap https://blink.ucsd.edu/_sitemaps/finance.xml
sitemap https://blink.ucsd.edu/_sitemaps/hr.xml
sitemap https://blink.ucsd.edu/_sitemaps/instructors.xml
sitemap https://blink.ucsd.edu/_sitemaps/research.xml
sitemap https://blink.ucsd.edu/_sitemaps/safety.xml
sitemap https://blink.ucsd.edu/_sitemaps/sponsor.xml
sitemap https://blink.ucsd.edu/_sitemaps/technology.xml
sitemap https://blink.ucsd.edu/_sitemaps/travel.xml

Comments

  • robots.txt for http://blink.ucsd.edu
  • Block pdf files - Non-standard but works for major search engines