spacecom.mil
robots.txt
Robots Exclusion Standard data for spacecom.mil
Resource Scan
Scan Details
Site Domain | spacecom.mil |
Base Domain | spacecom.mil |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-08-21T09:10:59+00:00 |
Next Scan | 2024-11-19T09:10:59+00:00 |
Last Successful Scan
Scanned | 2024-04-01T04:50:00+00:00 |
URL | https://www.spacecom.mil/robots.txt |
Domain IPs | 2600:1413:b000:6::17d5:2bca, 2600:1413:b000:6::17d5:2bd4, 96.17.96.27, 96.17.96.8 |
Response IP | 104.88.70.121 |
Found | Yes |
Hash | 435e6773683d828b7bfc77f1e7a61565c2c9bfa309126a148090abf1662faa55 |
SimHash | 7b0551018f06 |
Groups
*
Rule | Path |
---|---|
Disallow | *captcha* |
Disallow | /*Print.aspx |
Disallow | /*.axd$ |
Disallow | /*.exe$ |
Disallow | /bin/ |
Disallow | /Bin/ |
Disallow | /*.bin$ |
Disallow | /*.dll$ |
Disallow | /*.ssi$ |
Disallow | /Error/ |
Disallow | /Controls/ |
Disallow | /controls/ |
Disallow | /Utility/ |
Disallow | /install/ |
Disallow | /Admin/ |
Disallow | /App_Browser/ |
Disallow | /App_Code/ |
Disallow | /App_Data/ |
Disallow | /App_GlobalResources/ |
Disallow | /Components/ |
Disallow | /Config/ |
Disallow | /Documentation/ |
Disallow | /Install/ |
Disallow | /Providers/ |
Other Records
Field | Value |
---|---|
sitemap | /DesktopModules/SiteData/SiteMap.ashx |