waterboards.ca.gov
robots.txt

Robots Exclusion Standard data for waterboards.ca.gov

Resource Scan

Scan Details

Site Domain waterboards.ca.gov
Base Domain ca.gov
Scan Status Ok
Last Scan2024-11-11T18:40:17+00:00
Next Scan 2024-12-11T18:40:17+00:00

Last Scan

Scanned2024-11-11T18:40:17+00:00
URL https://waterboards.ca.gov/robots.txt
Domain IPs 199.83.129.36, 199.83.131.36
Response IP 199.83.131.36
Found Yes
Hash 8ba9d743be5ebf60b6b5510c88bbe5a3349be2cff892b95dfd2f5171d9c55860
SimHash 0905750b7710

Groups

*

Rule Path
Disallow /images/
Disallow /css/
Disallow /javascript/
Disallow /water_issues/programs/ewrims/statements/
Disallow /water_issues/programs/ewrims/statements/docs
Disallow /water_issues/programs/ewrims/wrims-data/
Disallow /water_issues/programs/ewrims/wrims-permits/
Disallow /ewrims/statements/
Disallow /ewrims/statements/docs/