www.gov.ca.gov
robots.txt

Robots Exclusion Standard data for www.gov.ca.gov

Resource Scan

Scan Details

Site Domain www.gov.ca.gov
Base Domain ca.gov
Scan Status Ok
Last Scan2024-10-18T21:50:17+00:00
Next Scan 2024-11-17T21:50:17+00:00

Last Scan

Scanned2024-10-18T21:50:17+00:00
URL https://www.gov.ca.gov/robots.txt
Domain IPs 141.193.213.10, 141.193.213.11
Response IP 141.193.213.11
Found Yes
Hash 0c60a999230ace0235bafd0d3332c0923f7d0c0627a86146310e6ffb0b0b2a3b
SimHash 690898008bb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.gov.ca.gov/sitemap.xml
sitemap https://www.gov.ca.gov/sitemap.html