army.mil
robots.txt

Robots Exclusion Standard data for army.mil

Resource Scan

Scan Details

Site Domain army.mil
Base Domain army.mil
Scan Status Ok
Last Scan2025-07-09T13:18:59+00:00
Next Scan 2025-08-08T13:18:59+00:00

Last Scan

Scanned2025-07-09T13:18:59+00:00
URL https://www.army.mil/robots.txt
Domain IPs 23.209.46.133, 23.209.46.160, 2600:1413:5000:12::1737:27e9, 2600:1413:5000:12::1737:27f9
Response IP 23.54.118.43
Found Yes
Hash 3b74deff710bd6328d05efec733c0f09b935d00c2c4b33c38a44623779fcd820
SimHash 2b5888300693

Groups

archive.org_bot

Rule Path
Disallow /

*

Rule Path
Disallow /usapa/
Disallow /USAPA/
Disallow /contact/
Disallow /contactus/
Disallow /e2/-images/
Disallow /mobile/article/
Disallow /aps/
Allow /e2/-images/**/*.jpg$

Other Records

Field Value
sitemap https://www.army.mil/sitemap.xml