pentagon.mil
robots.txt
Robots Exclusion Standard data for pentagon.mil
Resource Scan
Scan Details
Site Domain | pentagon.mil |
Base Domain | pentagon.mil |
Scan Status | Ok |
Last Scan | 2024-10-11T03:08:20+00:00 |
Next Scan | 2024-11-10T03:08:20+00:00 |
Last Scan
Scanned | 2024-10-11T03:08:20+00:00 |
URL | https://www.pentagon.mil/robots.txt |
Redirect | https://www.defense.gov/robots.txt |
Redirect Domain | www.defense.gov |
Redirect Base | defense.gov |
Domain IPs | 23.44.4.153, 23.44.4.154, 2600:1413:b000:6::17d5:2bca, 2600:1413:b000:6::17d5:2bd4 |
Redirect IPs | 104.110.130.214, 2600:1409:9800:1a3::3a30, 2600:1409:9800:1b1::3a30 |
Response IP | 23.198.122.203 |
Found | Yes |
Hash | 435e6773683d828b7bfc77f1e7a61565c2c9bfa309126a148090abf1662faa55 |
SimHash | 7b0551018f06 |
Groups
*
Rule | Path |
---|---|
Disallow | *captcha* |
Disallow | /*Print.aspx |
Disallow | /*.axd$ |
Disallow | /*.exe$ |
Disallow | /bin/ |
Disallow | /Bin/ |
Disallow | /*.bin$ |
Disallow | /*.dll$ |
Disallow | /*.ssi$ |
Disallow | /Error/ |
Disallow | /Controls/ |
Disallow | /controls/ |
Disallow | /Utility/ |
Disallow | /install/ |
Disallow | /Admin/ |
Disallow | /App_Browser/ |
Disallow | /App_Code/ |
Disallow | /App_Data/ |
Disallow | /App_GlobalResources/ |
Disallow | /Components/ |
Disallow | /Config/ |
Disallow | /Documentation/ |
Disallow | /Install/ |
Disallow | /Providers/ |
Other Records
Field | Value |
---|---|
sitemap | /DesktopModules/SiteData/SiteMap.ashx |