Ensuring System Safety and Resiliency for Mission-Critical Systems during the Operations and Maintenance Phase

被引:0
|
作者
Liao, Holmes [1 ]
机构
[1] MITRE Corp, Mclean, VA 22102 USA
关键词
resiliency; system safety; safety management system; mission-critical systems; machine learning; operations and maintenance; service thread; FMEA; NASA; DoD; FAA;
D O I
10.1109/ICNS60906.2024.10550733
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Ensuring system safety and resiliency of mission-critical systems throughout their life cycle is a continuous process. This paper delves into enhancing system safety during the operations and maintenance phase, a period susceptible to latent faults, design and implementation errors, worn-out equipment, and human errors. The author proposes a three-pronged approach encompassing service thread-based Failure Mode and Effects Analysis, machine learning techniques trained on historical and real-time operational data, and periodic analysis for operations and maintenance procedures. By integrating these strategies, the paper aims to bolster system resilience, mitigate potential system hazards and risks, and minimize system outages of mission-critical systems, ultimately ensuring uninterrupted operations.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Hiding role assignment in mission-critical collaborative systems
    Fu, XW
    Guan, Y
    Bettati, R
    Zhao, W
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2002, 18 (03) : 201 - 216
  • [32] Data center topologies for mission-critical business systems
    Cocchiara, R.
    Davis, H.
    Kinnaird, D.
    IBM SYSTEMS JOURNAL, 2008, 47 (04) : 695 - 706
  • [33] Contracting for Infrequent Restoration and Recovery of Mission-Critical Systems
    Kim, Sang-Hyun
    Cohen, Morris A.
    Netessine, Serguei
    Veeraraghavan, Senthil
    MANAGEMENT SCIENCE, 2010, 56 (09) : 1551 - 1567
  • [34] Data center topologies for mission-critical business systems
    IBM Global Technology Services, 288-300 Long Meadow Rd., Sterling Forest, NY 10992, United States
    不详
    不详
    IBM Syst J, 2008, 4 (695-706):
  • [35] Operations support systems for mission critical public safety communication networks
    Rader, Reinhard
    BELL LABS TECHNICAL JOURNAL, 2011, 16 (03) : 151 - 162
  • [36] The role of software failure modes and effects analysis for interfaces in safety- and mission-critical systems
    Ozarin, Nathaniel
    2008 2ND ANNUAL IEEE SYSTEMS CONFERENCE, 2008, : 252 - 259
  • [37] Presto - A system environment for mission-critical multimedia applications
    Huang, JD
    KenchammanaHosekote, D
    Agrawal, M
    Richardson, J
    REAL-TIME SYSTEMS, 1997, 13 (02) : 127 - 139
  • [38] Hybrid Power System Optimization in Mission-Critical Communication
    Leva, Sonia
    Grimaccia, Francesco
    Rozzi, Marco
    Mascherpa, Matteo
    ELECTRONICS, 2020, 9 (11) : 1 - 19
  • [39] Applying Java']Java™ technologies to mission-critical and safety-critical development
    Nilsen, K
    Larkham, A
    Constituents of Modern System-safety Thinking, 2005, : 211 - 223
  • [40] Real-time software design for safety- and mission-critical systems with high dependability
    Wang, Lingfeng
    2006 IEEE AUTOTESTCON, VOLS 1 AND 2, 2006, : 458 - 464