de.gov
robots.txt

Robots Exclusion Standard data for de.gov

Resource Scan

Scan Details

Site Domain de.gov
Base Domain de.gov
Scan Status Ok
Last Scan2026-01-01T19:04:26+00:00
Next Scan 2026-01-31T19:04:26+00:00

Last Scan

Scanned2026-01-01T19:04:26+00:00
URL https://de.gov/robots.txt
Response IP 167.21.84.89
Found Yes
Hash 2f5d0fd06a788d5debe3239357cd6eaf78328c5adeab4fbf9dfabf1a53889af4
SimHash 3ca04c228993

Groups

archive.org_bot

Rule Path
Disallow

*

Rule Path
Disallow /contact
Disallow /admin
Disallow /pages
Disallow /sso
Disallow /courtsearch