waterboards.ca.gov
robots.txt
Robots Exclusion Standard data for waterboards.ca.gov
Resource Scan
Scan Details
Site Domain | waterboards.ca.gov |
Base Domain | ca.gov |
Scan Status | Ok |
Last Scan | 2024-11-11T18:40:17+00:00 |
Next Scan | 2024-12-11T18:40:17+00:00 |
Last Scan
Scanned | 2024-11-11T18:40:17+00:00 |
URL | https://waterboards.ca.gov/robots.txt |
Domain IPs | 199.83.129.36, 199.83.131.36 |
Response IP | 199.83.131.36 |
Found | Yes |
Hash | 8ba9d743be5ebf60b6b5510c88bbe5a3349be2cff892b95dfd2f5171d9c55860 |
SimHash | 0905750b7710 |
Groups
*
Rule | Path |
---|---|
Disallow | /images/ |
Disallow | /css/ |
Disallow | /javascript/ |
Disallow | /water_issues/programs/ewrims/statements/ |
Disallow | /water_issues/programs/ewrims/statements/docs |
Disallow | /water_issues/programs/ewrims/wrims-data/ |
Disallow | /water_issues/programs/ewrims/wrims-permits/ |
Disallow | /ewrims/statements/ |
Disallow | /ewrims/statements/docs/ |