cdn.loc.gov
robots.txt

Robots Exclusion Standard data for cdn.loc.gov

Resource Scan

Scan Details

Site Domain cdn.loc.gov
Base Domain loc.gov
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-01-07T05:04:35+00:00
Next Scan 2026-04-07T05:04:35+00:00

Last Successful Scan

Scanned2025-08-16T20:09:35+00:00
URL https://cdn.loc.gov/robots.txt
Domain IPs 104.17.6.58, 104.18.64.82, 2606:4700::6811:63a, 2606:4700::6812:4052
Response IP 104.18.64.82
Found Yes
Hash 0a9d8412d50745749b0d6aa3a184138408108b2ccffd9a321103bcf048a078c6
SimHash 6a10d8106382

Groups

*

Rule Path
Disallow /master
Disallow /service

Other Records

Field Value
crawl-delay 10