EVALUATION AND COMPARISON OF FAULT-TOLERANT SOFTWARE TECHNIQUES

被引:16
|
作者
HUDAK, J [1 ]
SUH, BH [1 ]
SIEWIOREK, D [1 ]
SEGALL, Z [1 ]
机构
[1] CARNEGIE MELLON UNIV,DEPT ELECT & COMP ENGN,PITTSBURGH,PA 15213
关键词
FAULT-TOLERANT SOFTWARE; N-VERSION PROGRAMMING; RECOVERY BLOCK; CONCURRENT ERROR-DETECTION; ALGORITHM-BASED FAULT TOLERANCE; MARKOV MODEL; FAULT-INJECTION TESTING;
D O I
10.1109/24.229487
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various fault-tolerant software techniques have been proposed in order to meet the reliability requirements of critical systems. This paper evaluates 4 implementations of fault-tolerant software techniques with respect to hardware and design faults. Project participants were divided into 4 groups, each of which developed fault-tolerant software based on a common specification. Each group applied one of the following techniques: n-version programming, recovery block, concurrent error-detection, and algorithm-based fault tolerance. Independent testing and modeling groups within the project then thoroughly analyzed the fault-tolerant software. Using fault-injection tools, the testing group subjected the fault-tolerant software to simulated design and hardware faults. Simulated design-faults included control flow, array boundary, computational, and post/pre increment/decrement software mutations. Simulated hardware-faults included code and data corruption. Data collected from the fault-injection experiment were then mapped into a discrete-time Markov model developed by the modeling group. Based on this model, the effectiveness of each implementation of the fault-tolerant software technique with respect to availability, correctness, and time to failure given an error, is contrasted with measured data. Finally, the model is analyzed with respect to additional figures of merit identified during the modeling process, and the techniques are ranked using an application taxonomy.
引用
收藏
页码:190 / 204
页数:15
相关论文
共 50 条
  • [41] APPLICATION OF PETRI NET MODELS FOR THE EVALUATION OF FAULT-TOLERANT TECHNIQUES IN DISTRIBUTED SYSTEMS
    SHIEH, YB
    GHOSAL, D
    CHINTAMANENI, PR
    TRIPATHI, SK
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 151 - 159
  • [42] Architectural-Based Validation of Fault-Tolerant Software
    Brito, Patrick H. S.
    de Lemos, Rogerio
    Martinsi, Eliane
    Moraes, Regina
    Rubira, Cecilia M. F.
    LADC: 2009 4TH LATIN-AMERICAN SYMPOSIUM ON DEPENDABLE COMPUTING, 2009, : 103 - +
  • [43] SOFTWARE STRUCTURE OF THE BASIC FAULT-TOLERANT SYSTEM BFS
    GEITZ, G
    BERNHARDT, D
    SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1983, 12 (01): : 42 - 46
  • [44] Fault-tolerant distributed deployment of embedded control software
    Pinello, Claudio
    Carloni, Luca P.
    Sangiovanni-Vincentelli, Alberto L.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (05) : 906 - 919
  • [45] Behavioral analysis of a fault-tolerant software system with rejuvenation
    Rinsaka, K
    Dohi, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (12): : 2681 - 2690
  • [46] Software exploitation of a fault-tolerant computer with a large memory
    Eskesen, F
    Hack, M
    Iyengar, A
    King, RP
    Halim, N
    TWENTY-EIGHTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST PAPERS, 1998, : 336 - 345
  • [47] FAULT-TOLERANT SOFTWARE FOR REAL-TIME APPLICATIONS
    HECHT, H
    COMPUTING SURVEYS, 1976, 8 (04) : 391 - 407
  • [48] Analysis for performance and reliability of fault-tolerant parallel software
    Sugino, Eiji
    Yokota, Haruo
    Systems and Computers in Japan, 2000, 31 (07) : 56 - 65
  • [49] THE N-VERSION APPROACH TO FAULT-TOLERANT SOFTWARE
    AVIZIENIS, A
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) : 1491 - 1501
  • [50] The Fault-tolerant Software Design in Warship Control System
    Wen, Hong
    Liu, Yong
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 512 - 515