world.public.law
robots.txt
Robots Exclusion Standard data for world.public.law
Resource Scan
Scan Details
Site Domain | world.public.law |
Base Domain | public.law |
Scan Status | Ok |
Last Scan | 2024-05-08T16:40:47+00:00 |
Next Scan | 2024-05-22T16:40:47+00:00 |
Last Scan
Scanned | 2024-05-08T16:40:47+00:00 |
URL | https://world.public.law/robots.txt |
Redirect | https://www.public.law/robots.txt |
Redirect Domain | www.public.law |
Redirect Base | public.law |
Domain IPs | 172.66.40.101, 172.66.43.155, 2606:4700:3108::ac42:2865, 2606:4700:3108::ac42:2b9b |
Redirect IPs | 172.66.40.101, 172.66.43.155, 2606:4700:3108::ac42:2865, 2606:4700:3108::ac42:2b9b |
Response IP | 172.66.43.155 |
Found | Yes |
Hash | e84bf1c57153984ec027824045085d9da189b73664c6c7384364a2e667ccea16 |
SimHash | 8d00c5100911 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /api |
Disallow | /data |
Disallow | /demo |
Disallow | /images |
Other Records
Field | Value |
---|---|
sitemap | https://www.public.law/sitemaps/sitemap.xml.gz |