heasarc.gsfc.nasa.gov
robots.txt

Robots Exclusion Standard data for heasarc.gsfc.nasa.gov

Resource Scan

Scan Details

Site Domain heasarc.gsfc.nasa.gov
Base Domain nasa.gov
Scan Status Ok
Last Scan2025-09-30T18:40:36+00:00
Next Scan 2025-10-30T18:40:36+00:00

Last Scan

Scanned2025-09-30T18:40:36+00:00
URL https://heasarc.gsfc.nasa.gov/robots.txt
Domain IPs 129.164.179.23, 2001:4d0:2310:150::23
Response IP 129.164.179.23
Found Yes
Hash 08d75ac582d597bcf8879a9bc9037f281f7505f0d13b57cda5906f01889f38dc
SimHash 4420ededc574

Groups

*

Rule Path
Disallow /FTP/
Disallow /cgi-bin/
Disallow /db-perl/
Disallow /cgi-perl/
Disallow /xte_weather/data/
Disallow /Tools/xraybg/images/
Disallow /Tools/xraybg/spectrum/
Disallow /vo/
Disallow /uadb/
Disallow /srvlogin/
Disallow /xamin
Disallow /vo/validation/bin/

Comments

  • robots.txt for http://heasarc.gsfc.nasa.gov/
  • Keep them robots out of the FTP area.