Non-intrusive system level fault-tolerance

被引:0
|
作者
Lundqvist, K [1 ]
Srinivasan, J [1 ]
Gorelov, S [1 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Embedded Syst Lab, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
High-integrity embedded systems operate in multiple modes, in order to ensure system availability in the face of faults. Unanticipated state-dependent faults that remain in software after system design and development behave like hardware transient faults: they appear, do the damage and disappear. The conventional approach used for handling task overruns caused by transient faults is to use a single recovery task that implements minimal functionality. This approach provides limited availability and should be used as a last resort in order to keep the system online. Traditional fault detection approaches are often intrusive in that they consume processor resources in order to monitor system behavior. This paper presents a novel approach for fault-monitoring by leveraging the Ravenscar profile, model-checking and a system-on-chip implementation of both the kernel and an execution time monitor. System fault-tolerance is provided through a hierarchical set of operational modes that are based on tin-ling behavior violations of individual tasks within the application. The approach is illustrated through a simple case study of a generic navigation system.
引用
下载
收藏
页码:156 / 166
页数:11
相关论文
共 50 条
  • [1] A METHOD TO DETERMINE THE LEVEL OF THE INFORMATION SYSTEM FAULT-TOLERANCE
    Boranbayev, A. S.
    Boranbayev, S. N.
    Nurusheva, A. M.
    Seitkulov, Y. N.
    Sissenov, N. M.
    EURASIAN JOURNAL OF MATHEMATICAL AND COMPUTER APPLICATIONS, 2019, 7 (03): : 13 - 32
  • [2] Qualitative Evaluation of Fault Hypotheses with Non-Intrusive Fault Injection
    Frtunikj, Jelena
    Froehlich, Joachim
    Rohlfs, Tim
    Knoll, Alois
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2015, : 160 - 167
  • [3] A Non-Intrusive Appliance Recognition System
    Bugnot, Reinelle Jan C.
    Macabebe, Erees Queen B.
    2019 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND INTELLIGENCE SYSTEM (IOTAIS), 2019, : 141 - 147
  • [4] Non-Intrusive Cooling System Fault Detection and Diagnostics Using Acoustic Emission
    Pandey, Hari
    Waldo, Weston
    Hu, Han
    PROCEEDINGS OF ASME 2022 HEAT TRANSFER SUMMER CONFERENCE, HT2022, 2022,
  • [5] Incorporating Fault-Tolerance Awareness into System-Level Modeling and Simulation
    Johnson, Trokon
    Lam, Herman
    2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 829 - 830
  • [6] Configurable spare processors: A new approach to system level fault-tolerance
    Kim, K
    Karri, R
    Potkonjak, M
    1996 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 1996, : 295 - 303
  • [7] Incorporating Fault-Tolerance Awareness into System-Level Modeling and Simulation
    Johnson, Trokon
    Lam, Herman
    PROCEEDINGS OF WORKSHOP ON FAULT TOLERANCE FOR HPC AT EXTREME SCALE (FTXS 2021), 2021, : 31 - 40
  • [8] NON-INTRUSIVE ESTIMATION OF THE LEVEL OF REVERBERATION IN SPEECH
    Parada, Pablo Peso
    Sharma, Dushyant
    Naylor, Patrick A.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] A Non-intrusive Multi-parameter Fault Diagnosis System for Industrial Machineries
    Wang, Shanqing
    Tang, Chengpei
    Zhou, Chancheng
    Zheng, Xiaolong
    2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), 2018, : 714 - 721
  • [10] MicroFI: Non-Intrusive and Prioritized Request-Level Fault Injection for Microservice Applications
    Chen, Hongyang
    Chen, Pengfei
    Yu, Guangba
    Li, Xiaoyun
    He, Zilong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (05) : 4921 - 4938