EVALUATION AND COMPARISON OF FAULT-TOLERANT SOFTWARE TECHNIQUES

被引:16
|
作者
HUDAK, J [1 ]
SUH, BH [1 ]
SIEWIOREK, D [1 ]
SEGALL, Z [1 ]
机构
[1] CARNEGIE MELLON UNIV,DEPT ELECT & COMP ENGN,PITTSBURGH,PA 15213
关键词
FAULT-TOLERANT SOFTWARE; N-VERSION PROGRAMMING; RECOVERY BLOCK; CONCURRENT ERROR-DETECTION; ALGORITHM-BASED FAULT TOLERANCE; MARKOV MODEL; FAULT-INJECTION TESTING;
D O I
10.1109/24.229487
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various fault-tolerant software techniques have been proposed in order to meet the reliability requirements of critical systems. This paper evaluates 4 implementations of fault-tolerant software techniques with respect to hardware and design faults. Project participants were divided into 4 groups, each of which developed fault-tolerant software based on a common specification. Each group applied one of the following techniques: n-version programming, recovery block, concurrent error-detection, and algorithm-based fault tolerance. Independent testing and modeling groups within the project then thoroughly analyzed the fault-tolerant software. Using fault-injection tools, the testing group subjected the fault-tolerant software to simulated design and hardware faults. Simulated design-faults included control flow, array boundary, computational, and post/pre increment/decrement software mutations. Simulated hardware-faults included code and data corruption. Data collected from the fault-injection experiment were then mapped into a discrete-time Markov model developed by the modeling group. Based on this model, the effectiveness of each implementation of the fault-tolerant software technique with respect to availability, correctness, and time to failure given an error, is contrasted with measured data. Finally, the model is analyzed with respect to additional figures of merit identified during the modeling process, and the techniques are ranked using an application taxonomy.
引用
收藏
页码:190 / 204
页数:15
相关论文
共 50 条
  • [21] HARDWARE AND SOFTWARE FOR FAULT-TOLERANT COMPUTING SYSTEMS
    SOGOMONYAN, ES
    SHAGAEV, IV
    AUTOMATION AND REMOTE CONTROL, 1988, 49 (02) : 129 - 151
  • [22] Aspects for improvement of performance in fault-tolerant software
    Szentiványi, D
    Nadjm-Tehrani, S
    10TH IEEE PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2004, : 283 - 291
  • [23] Achieving fault-tolerant software with rejuvenation and reconfiguration
    Yurcik, W
    Doss, D
    IEEE SOFTWARE, 2001, 18 (04) : 48 - +
  • [24] RELIABILITY-GROWTH OF FAULT-TOLERANT SOFTWARE
    KANOUN, K
    KAANICHE, M
    BEOUNES, C
    LAPRIE, JC
    ARLAT, J
    IEEE TRANSACTIONS ON RELIABILITY, 1993, 42 (02) : 205 - 219
  • [25] Reliability simulation of fault-tolerant software and systems
    Gokhale, SS
    Lyu, MR
    Trivedi, KS
    PACIFIC RIM INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 1997, : 167 - 173
  • [26] The effect of testing on reliability of fault-tolerant software
    Popov, P
    Littlewood, B
    2004 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2004, : 265 - 274
  • [27] Optimal structure of fault-tolerant software systems
    Levitin, G
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2005, 89 (03) : 286 - 295
  • [28] Fault-tolerant Hybrid Quantum Software Systems
    Scheerer, Max
    Klamroth, Jonas
    Denninger, Oliver
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM SOFTWARE (IEEE QSW 2022), 2022, : 52 - 57
  • [29] Towards reliability optimization of fault-tolerant software
    Belli, F.
    Bremerhaven, Hochschule
    Jedrzejowicz, P.
    IT - Information Technology, 1987, 29 (02): : 61 - 68
  • [30] FAULT-TOLERANT SOFTWARE-RELIABILITY MODELING
    SCOTT, RK
    GAULT, JW
    MCALLISTER, DF
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (05) : 582 - 592