www.courts.ca.gov
robots.txt

Robots Exclusion Standard data for www.courts.ca.gov

Resource Scan

Scan Details

Site Domain www.courts.ca.gov
Base Domain ca.gov
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-28T07:32:38+00:00
Next Scan 2024-12-27T07:32:38+00:00

Last Successful Scan

Scanned2024-08-07T06:13:19+00:00
URL https://www.courts.ca.gov/robots.txt
Domain IPs 54.191.57.30
Response IP 54.191.57.30
Found Yes
Hash 018ad31bbbada6b33a7a4d152f6db1ca5ac61c4f526fe9cb0e5a98c558339f36
SimHash a823c82406c3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /opinions/nonpub/
Disallow /opinions/revnppub/
Disallow /documents/jbcl-manual.pdf
Disallow /documents/jbcl-faq.pdf
Disallow /cms/archive/
Disallow /cms/courtinterpreters/
Disallow /documents/fl301.pdf
Disallow /documents/itac-20190402-materials.pdf
Disallow /41185.htm
Disallow /documents/dcacs_vendor.xsd
Disallow /archive/D072019.DOC
Disallow /archive/D072019.PDF
Disallow /cms/rules/printfriendly.cfm

Other Records

Field Value
sitemap https://www.courts.ca.gov/sitemap.xml