nssdc.gsfc.nasa.gov
robots.txt

Robots Exclusion Standard data for nssdc.gsfc.nasa.gov

Resource Scan

Scan Details

Site Domain nssdc.gsfc.nasa.gov
Base Domain nasa.gov
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-30T14:05:37+00:00
Next Scan 2025-12-29T14:05:37+00:00

Last Successful Scan

Scanned2025-05-11T12:41:56+00:00
URL https://nssdc.gsfc.nasa.gov/robots.txt
Domain IPs 169.154.128.124, 2001:4d0:2418:128::124
Response IP 169.154.128.124
Found Yes
Hash b944e60ac34b04389a750f101eb452fa063c5165c9a95993595012d9942a61c5
SimHash b0586b6dcc32

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /bu
Disallow /ccsds
Disallow /database
Disallow /in
Disallow /lo
Disallow /nasa
Disallow /nmc/t
Disallow /nmc/masterCatalog
Disallow /sswg
Disallow /st
Disallow /util
Disallow /nmc/personDisplay
Disallow /nmc/publicationDisplay
Disallow /nmc/SpacecraftQ
Disallow /nmc/ExperimentQ
Disallow /nmc/DatasetQ
Disallow /nmc/PersonQ
Disallow /nmc/PublicationQ
Disallow /nmc/MapQ
Disallow /nmc/EventQ
Disallow /nmc/NewDataQ
Disallow /man

gsfcbot

Rule Path
Disallow /nmc/

Comments

  • /robots.txt for nssdc.gsfc.nasa.gov
  • See http://info.webcrawler.com/mak/projects/robots/norobots.html
  • by default