www.cac.ca.gov
robots.txt

Robots Exclusion Standard data for www.cac.ca.gov

Resource Scan

Scan Details

Site Domain www.cac.ca.gov
Base Domain ca.gov
Scan Status Ok
Last Scan2024-10-29T01:23:39+00:00
Next Scan 2024-11-28T01:23:39+00:00

Last Scan

Scanned2024-10-29T01:23:39+00:00
URL https://www.cac.ca.gov/robots.txt
Redirect https://arts.ca.gov/robots.txt
Redirect Domain arts.ca.gov
Redirect Base ca.gov
Domain IPs 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Redirect IPs 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Response IP 23.185.0.4
Found Yes
Hash ddfd66a35ecd9162a7c0d7d911b781c36651dd380cf81aa7a3670f34d367f37c
SimHash 61004c408f93

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://dev-cacgov.pantheonsite.io/sitemap_index.xml