weblaws.org
robots.txt
Robots Exclusion Standard data for weblaws.org
Resource Scan
Scan Details
Site Domain | weblaws.org |
Base Domain | weblaws.org |
Scan Status | Ok |
Last Scan | 2024-05-08T14:28:18+00:00 |
Next Scan | 2024-05-15T14:28:18+00:00 |
Last Scan
Scanned | 2024-05-08T14:28:18+00:00 |
URL | https://weblaws.org/robots.txt |
Redirect | https://oregon.public.law/robots.txt |
Redirect Domain | oregon.public.law |
Redirect Base | public.law |
Domain IPs | 104.21.16.146, 172.67.213.62, 2606:4700:3033::ac43:d53e, 2606:4700:3036::6815:1092 |
Redirect IPs | 172.66.40.101, 172.66.43.155, 2606:4700:3108::ac42:2865, 2606:4700:3108::ac42:2b9b |
Response IP | 172.66.40.101 |
Found | Yes |
Hash | f04ff2f37e77c5ec98cf76b38e75a8b70de695ee35fd4806a543f3ca44f9d051 |
SimHash | ed094d100913 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /api |
Disallow | /data |
Disallow | /demo |
Disallow | /images |
Other Records
Field | Value |
---|---|
sitemap | https://oregon.public.law/sitemaps/sitemap.xml.gz |