Topic Modeling of NASA Space System Problem Reports

被引:13
|
作者
Layman, Lucas [1 ]
Nikora, Allen P. [2 ]
Meek, Joshua [1 ]
Menzies, Tim [3 ]
机构
[1] Fraunhofer CESE, College Pk, MD 20740 USA
[2] CALTECH, Jet Prop Lab, Pasadena, CA USA
[3] North Carolina State Univ, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
topic modeling; data mining; defects; natural language processing; LDA; INFORMATION-RETRIEVAL; DEFECT REPORTS;
D O I
10.1145/2901739.2901760
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Problem reports at NASA are similar to bug reports: they capture defects found during test, post-launch operational anomalies, and document the investigation and corrective action of the issue. These artifacts are a rich source of lessons learned for NASA, but are expensive to analyze since problem reports are comprised primarily of natural language text. We apply topic modeling to a corpus of NASA problem reports to extract trends in testing and operational failures. We collected 16,669 problem reports from six NASA space flight missions and applied Latent Dirichlet Allocation topic modeling to the document corpus. We analyze the most popular topics within and across missions, and how popular topics changed over the lifetime of a mission. We find that hardware material and flight software issues are common during the integration and testing phase, while ground station software and equipment issues are more common during the operations phase. We identify a number of challenges in topic modeling for trend analysis: 1) that the process of selecting the topic modeling parameters lacks de finitive guidance, 2) de fining semantically-meaningful topic labels requires non-trivial effort and domain expertise, 3) topic models derived from the combined corpus of the six missions were biased toward the larger missions, and 4) topics must be semantically distinct as well as cohesive to be useful. Nonetheless, topic modeling can identify problem themes within missions and across mission lifetimes, providing useful feedback to engineers and project managers.
引用
收藏
页码:303 / 314
页数:12
相关论文
共 50 条
  • [1] Determine Component Probability from NASA Ground System Problem Reports
    Monaghan, Mark W.
    Gillespie, Amanda M.
    [J]. 59TH ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS), 2013,
  • [2] Space debris modeling at NASA
    Johnson, NL
    [J]. PROCEEDINGS OF THE THIRD EUROPEAN CONFERENCE ON SPACE DEBRIS, VOLS 1 AND 2, 2001, 473 : 259 - 264
  • [3] Retrieving NASA problem reports with natural language
    van Delden, S
    Gomez, F
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2002, 2553 : 150 - 159
  • [4] Assessment of the NASA Space Shuttle Program's problem reporting and corrective action system
    Korsmeyer, DJ
    Schreiner, JA
    [J]. COMPONENT AND SYSTEMS DIAGNOSTICS, PROGNOSIS AND HEALTH MANAGEMENT, 2001, 4389 : 174 - 185
  • [5] HOW NASA BUILDS A SYSTEM IN SPACE
    不详
    [J]. SYSTEMS INTEGRATION BUSINESS, 1990, 23 (09): : 40 - &
  • [6] Methodology of problem space modeling in industrial enterprise management system
    Glushchevsky, V. V.
    [J]. MARKETING AND MANAGEMENT OF INNOVATIONS, 2015, (01): : 124 - 134
  • [7] Space science - Reports will urge overhaul and delays to NASA's Mars Missions
    Lawler, A
    [J]. SCIENCE, 2000, 287 (5459) : 1722 - 1723
  • [8] Retrieving NASA problem reports: a case study in natural language information retrieval
    van Delden, S
    Gomez, F
    [J]. DATA & KNOWLEDGE ENGINEERING, 2004, 48 (02) : 231 - 246
  • [9] NASA AND SPACE
    NEWELL, HE
    [J]. BULLETIN OF THE ATOMIC SCIENTISTS, 1961, 17 (5-6) : 222 - 229
  • [10] Medical Sterilization System for NASA Space Exploration Missions
    Duda, Zachary
    Gaffney, John, II
    Graves, Christopher
    Moore, Quinlan
    Watkins, James
    Nagel, Jacquelyn
    [J]. 2017 SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (SIEDS), 2017, : 277 - 282