Restart-Based Fault-Tolerance: System Design and Schedulability Analysis

被引:0
|
作者
Abdi, Fardin [1 ]
Mancuso, Renato [1 ]
Tabish, Rohan [1 ]
Caccamo, Marco [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, 1304 W Springfield Ave, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
REAL-TIME SYSTEMS; PERIODIC TASKS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Embedded systems in safety-critical environments are continuously required to deliver more performance and functionality, while expected to provide verified safety guarantees. Nonetheless, platform-wide software verification (required for safety) is often expensive. Therefore, design methods that enable utilization of components such as real-time operating systems (RTOS), without requiring their correctness to guarantee safety, is necessary. In this paper, we propose a design approach to deploy safe-by-design embedded systems. To attain this goal, we rely on a small core of verified software to handle faults in applications and RTOS and recover from them while ensuring that timing constraints of safety-critical tasks are always satisfied. Faults are detected by monitoring the application timing and fault-recovery is achieved via full platform restart and software reload, enabled by the short restart time of embedded systems. Schedulability analysis is used to ensure that the timing constraints of critical plant control tasks are always satisfied in spite of faults and consequent restarts. We derive schedulability results for four restart-tolerant task models. We use a simulator to evaluate and compare the performance of the considered scheduling models.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [21] A Robot Fault-tolerance Approach Based on Fault Type
    Shim, Bingu
    Baek, Beomho
    Kim, Suntae
    Park, Sooyong
    2009 NINTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE (QSIC 2009), 2009, : 296 - 304
  • [22] Guaranteed Physical Security with Restart-Based Design for Cyber-Physical Systems
    Abdi, Fardin
    Chen, Chien-Ying
    Hasan, Monowar
    Liu, Songran
    Mohan, Sibin
    Caccamo, Marco
    2018 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2018), 2018, : 10 - 21
  • [23] Hardware fault-tolerance within the POEtic system
    Barker, W
    Tyrrell, AM
    EVOLVABLE SYSTEMS: FROM BIOLOGY TO HARDWARE, 2005, 3637 : 25 - 36
  • [24] Adding fault-tolerance to a hierarchical DRE system
    Rubel, Paul
    Loyall, Joseph
    Schantz, Richard
    Gillen, Matthew
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, PROCEEDINGS, 2006, 4025 : 303 - 308
  • [25] REPLICATION AND FAULT-TOLERANCE IN THE ISIS SYSTEM.
    Birman, Kenneth P.
    Operating Systems Review (ACM), 1985, 19 (05): : 79 - 86
  • [27] Performance, Fault-tolerance and Scalability Analysis of Virtual Infrastructure Management System
    Kong, Xiangzhen
    Huang, Jiwei
    Lin, Chuang
    Ungsunan, Peter D.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 282 - 289
  • [28] Design and Realization of a Fault-Tolerance Model to Distributed Simulation System of Hydropower Plant
    Zhang, Binqiao
    Wu, Chengming
    Li, Xianshan
    Wang, Pengyu
    Liu, Rongzhang
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [29] ANALYSIS OF A FAULT-TOLERANCE SCHEME FOR PROCESSOR ENSEMBLES
    UPADHYAYA, SJ
    CHAKRAVARTY, S
    IEEE TRANSACTIONS ON RELIABILITY, 1992, 41 (02) : 294 - 303
  • [30] Automated analysis of fault-tolerance in distributed systems
    Stoller, SD
    Schneider, FB
    FORMAL METHODS IN SYSTEM DESIGN, 2005, 26 (02) : 183 - 196