spacecom.mil
robots.txt

Robots Exclusion Standard data for spacecom.mil

Resource Scan

Scan Details

Site Domain spacecom.mil
Base Domain spacecom.mil
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-21T09:10:59+00:00
Next Scan 2024-11-19T09:10:59+00:00

Last Successful Scan

Scanned2024-04-01T04:50:00+00:00
URL https://www.spacecom.mil/robots.txt
Domain IPs 2600:1413:b000:6::17d5:2bca, 2600:1413:b000:6::17d5:2bd4, 96.17.96.27, 96.17.96.8
Response IP 104.88.70.121
Found Yes
Hash 435e6773683d828b7bfc77f1e7a61565c2c9bfa309126a148090abf1662faa55
SimHash 7b0551018f06

Groups

*

Rule Path
Disallow *captcha*
Disallow /*Print.aspx
Disallow /*.axd$
Disallow /*.exe$
Disallow /bin/
Disallow /Bin/
Disallow /*.bin$
Disallow /*.dll$
Disallow /*.ssi$
Disallow /Error/
Disallow /Controls/
Disallow /controls/
Disallow /Utility/
Disallow /install/
Disallow /Admin/
Disallow /App_Browser/
Disallow /App_Code/
Disallow /App_Data/
Disallow /App_GlobalResources/
Disallow /Components/
Disallow /Config/
Disallow /Documentation/
Disallow /Install/
Disallow /Providers/

Other Records

Field Value
sitemap /DesktopModules/SiteData/SiteMap.ashx