FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance

被引:39
|
作者
Schirmeier, Horst [1 ]
Hoffmann, Martin [2 ]
Dietrich, Christian [2 ]
Lenz, Michael [1 ]
Lohmann, Daniel [2 ]
Spinczyk, Olaf [1 ]
机构
[1] Tech Univ Dortmund, Dept Comp Sci 12, Dortmund, Germany
[2] Univ Erlangen Nurnberg, Chair Distributed Syst & Operating Syst, Erlangen, Germany
关键词
DEPENDABILITY; ERRORS;
D O I
10.1109/EDCC.2015.28
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Due to voltage and structure shrinking, the influence of radiation on a circuit's operation increases, resulting in future hardware designs exhibiting much higher rates of soft errors. Software developers have to cope with these effects to ensure functional safety. However, software-based hardware fault tolerance is a holistic property that is tricky to achieve in practice, potentially impaired by every single design decision. We present FAIL*, an open and versatile architecture-level fault-injection (FI) framework for the continuous assessment and quantification of fault tolerance in an iterative software development process. FAIL* supplies the developer with reusable and composable FI campaigns, advanced pre-and post-processing analyses to easily identify sensitive spots in the software, well-abstracted back-end implementations for several hardware and simulator platforms, and scalability of FI campaigns by providing massive parallelization. We describe FAIL*, its application to the development process of safety-critical software, and the lessons learned from a real-world example.
引用
收藏
页码:245 / 255
页数:11
相关论文
共 50 条
  • [1] A software-implemented fault injection methodology for design and validation of system fault tolerance
    Some, RR
    Kim, WS
    Khanoyan, G
    Callum, L
    Agrawal, A
    Beahan, JJ
    [J]. INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2001, : 501 - 506
  • [2] A new approach to software-implemented fault tolerance
    Rebaudengo, M
    Reorda, MS
    Violante, M
    [J]. JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2004, 20 (04): : 433 - 437
  • [3] A New Approach to Software-Implemented Fault Tolerance
    M. Rebaudengo
    M. Sonza Reorda
    M. Violante
    [J]. Journal of Electronic Testing, 2004, 20 : 433 - 437
  • [4] THE SOFTWARE-IMPLEMENTED FAULT TOLERANCE (SIFT) APPROACH TO FAULT TOLERANT COMPUTING
    GOLDBERG, J
    [J]. PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1981, 298 : 289 - 293
  • [5] The recovery language approach for software-implemented fault tolerance
    De Florio, V
    Deconinck, C
    Lauwereins, R
    [J]. NINTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2001, : 418 - 425
  • [6] Comparison of physical and software-implemented fault injection techniques
    Arlat, J
    Crouzet, Y
    Karlsson, J
    Folkesson, P
    Fuchs, E
    Leber, GH
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2003, 52 (09) : 1115 - 1133
  • [7] COMPAS: Compiler-assisted Software-implemented Hardware Fault Tolerance for RISC-V
    Sharif, Uzair
    Mueller-Gritschneder, Daniel
    Schlichtmann, Ulf
    [J]. 2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 80 - 83
  • [8] A PERFORMANCE EVALUATION OF THE SOFTWARE-IMPLEMENTED FAULT-TOLERANCE COMPUTER
    PALUMBO, DL
    BUTLER, RW
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1986, 9 (02) : 175 - 180
  • [9] A Controller Safety Concept Based on Software-Implemented Fault Tolerance for Fail-Operational Automotive Applications
    Ghadhab, Majdi
    Kuntz, Matthias
    Kuvaiskii, Dmitrii
    Fetzer, Christof
    [J]. FORMAL TECHNIQUES FOR SAFETY-CRITICAL SYSTEMS, (FTSCS 2015), 2016, 596 : 189 - 205
  • [10] Non-intrusive software-implemented fault injection in embedded systems
    Yuste, P
    Ruiz, JC
    Lemus, L
    Gil, P
    [J]. DEPENDABLE COMPUTING, 2003, 2847 : 23 - 38