treasurer.ca.gov
robots.txt

Robots Exclusion Standard data for treasurer.ca.gov

Resource Scan

Scan Details

Site Domain treasurer.ca.gov
Base Domain ca.gov
Scan Status Ok
Last Scan2024-08-28T18:42:19+00:00
Next Scan 2024-09-27T18:42:19+00:00

Last Scan

Scanned2024-08-28T18:42:19+00:00
URL https://treasurer.ca.gov/robots.txt
Redirect https://www.treasurer.ca.gov/robots.txt
Redirect Domain www.treasurer.ca.gov
Redirect Base ca.gov
Domain IPs 45.223.157.127, 45.223.58.127
Redirect IPs 45.223.148.127
Response IP 45.223.148.127
Found Yes
Hash 78ce78be1fe8e020a66e76d993b2bafdfe1ba4e093583c4a00e8260a752881a2
SimHash 2b5a5144e993

Groups

*

Rule Path
Disallow /_private/
Disallow /cgi-bin/
Disallow /js/
Disallow /style/
Disallow /ssi/
Disallow /common/
Disallow /link_checker.txt
Disallow /contact_post1.asp
Disallow /contact_post2.asp
Disallow /path.asp
Disallow /search.asp
Disallow /webcomments.asp
Disallow /comments.asp
Disallow /ctcac/arra_apps/questions.pdf
Disallow /aup.pdf
Disallow /chffa/bondbuyer.pdf
Disallow /inside/divisions/cashmanagement.asp
Disallow /dms/library.asp

Other Records

Field Value
sitemap http://www.treasurer.ca.gov/sitemap.xml