regulations.justia.com
robots.txt

Robots Exclusion Standard data for regulations.justia.com

Resource Scan

Scan Details

Site Domain regulations.justia.com
Base Domain justia.com
Scan Status Ok
Last Scan2026-02-01T12:24:16+00:00
Next Scan 2026-03-03T12:24:16+00:00

Last Scan

Scanned2026-02-01T12:24:16+00:00
URL https://regulations.justia.com/robots.txt
Domain IPs 104.18.12.16, 104.18.13.16, 2606:4700::6812:c10, 2606:4700::6812:d10
Response IP 104.18.13.16
Found Yes
Hash aee2238ef5f196aef25e023a10e488811880b20cebc56e0191510a476973f197
SimHash d14cc132c315

Groups

*

Rule Path
Disallow /cfr/
Disallow /cfr/*
Disallow /cfr
Disallow /cases/federal/district-courts/BR/207/151/1536870/
Disallow /cases/federal/district-courts/BR/207/151/1536870
Disallow /cases/federal/district-courts/FSupp2/297/840/2326765/
Disallow /cases/federal/district-courts/FSupp2/297/840/2326765
Disallow /cases/federal/district-courts/FSupp/684/46/1896419/
Disallow /cases/federal/district-courts/FSupp/684/46/1896419
Disallow /cases/federal/district-courts/FSupp/919/1141/1580784/
Disallow /cases/federal/district-courts/FSupp/919/1141/1580784
Allow /