digicoll.lib.berkeley.edu
robots.txt

Robots Exclusion Standard data for digicoll.lib.berkeley.edu

Resource Scan

Scan Details

Site Domain digicoll.lib.berkeley.edu
Base Domain berkeley.edu
Scan Status Ok
Last Scan2025-12-11T00:10:55+00:00
Next Scan 2026-01-10T00:10:55+00:00

Last Scan

Scanned2025-12-11T00:10:55+00:00
URL https://digicoll.lib.berkeley.edu/robots.txt
Domain IPs 54.170.61.187, 54.229.193.130
Response IP 54.229.193.130
Found Yes
Hash 68b2b1af1ea21038997a6cf7b9c35823afabb488a52af0733fdbc5f812c4f59d
SimHash 6d2d9c50c199

Groups

*

Rule Path
Disallow /rss
Disallow /search

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://digicoll.lib.berkeley.edu/sitemap_index.xml.gz

Warnings

  • 1 invalid line.