digicoll.lib.berkeley.edu
robots.txt
Robots Exclusion Standard data for digicoll.lib.berkeley.edu
Resource Scan
Scan Details
| Site Domain | digicoll.lib.berkeley.edu |
| Base Domain | berkeley.edu |
| Scan Status | Ok |
| Last Scan | 2025-12-11T00:10:55+00:00 |
| Next Scan | 2026-01-10T00:10:55+00:00 |
Last Scan
| Scanned | 2025-12-11T00:10:55+00:00 |
| URL | https://digicoll.lib.berkeley.edu/robots.txt |
| Domain IPs | 54.170.61.187, 54.229.193.130 |
| Response IP | 54.229.193.130 |
| Found | Yes |
| Hash | 68b2b1af1ea21038997a6cf7b9c35823afabb488a52af0733fdbc5f812c4f59d |
| SimHash | 6d2d9c50c199 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /rss |
| Disallow | /search |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 5 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://digicoll.lib.berkeley.edu/sitemap_index.xml.gz |
Warnings
- 1 invalid line.