RESPONDING TO CATASTROPHIC ERRORS - A DESIGN TECHNIQUE FOR FAULT-TOLERANT SOFTWARE

被引:0
|
作者
DAVIS, FGF [1 ]
GANTENBEIN, RE [1 ]
机构
[1] UNIV WYOMING,DEPT COMP SCI,OPERATING SYST LAB,LARAMIE,WY 82071
关键词
D O I
10.1016/0164-1212(92)90113-X
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The usual classification of software-caused system errors as internal, external, or pervasive assumes a rippling propagation of errors through a hierarchy of structures. As a result, most fault-tolerant software handles errors through nested detection and recovery mechanisms. In many cases, particularly in distributed systems, this assumption may not hold; catastrophic errors may occur that can evade the boundaries of the usual mechanisms and cause large-scale system failure. System designers must consider the possibility of failure from the first stages of system development, define the circumstances under which these failures might occur, and analyze the costs of dealing with such failures. Fault-tolerance techniques can be applied to reduce the effect of catastrophic errors. One such technique, dynamic reconfiguration, is described here as an example of a practical way for a system to respond to a detected error. Dynamic reconfiguration can be used not only to recover from software errors but also to remove the faults that caused the errors. An example of the design of a life-critical software system using dynamic configuration to handle potentially catastrophic errors is presented.
引用
下载
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [41] ON THE DESIGN OF FAULT-TOLERANT SIGNAL DETECTORS
    MEYER, GGL
    WEINERT, HL
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04): : 973 - 978
  • [42] On the design of a fault-tolerant flight controller
    Yu, XH
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XII, PROCEEDINGS: INDUSTRIAL SYSTEMS AND ENGINEERING II, 2002, : 199 - 202
  • [43] Design of fault-tolerant control for MTTF
    Li, Hongbin
    Zhao, Qing
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2008, 18 (16) : 1551 - 1574
  • [44] Fault-tolerant PACS server design
    Huang, HK
    Cao, F
    Liu, BJ
    Zhang, J
    Zhou, Z
    Tsai, A
    Mogel, G
    MEDICAL IMAGING 2001: PACS AND INTEGRATED MEDICAL INFORMATION SYSTEMS: DESIGN AND EVALUATION, 2001, 4323 : 83 - 92
  • [45] Fault-tolerant teleoperation systems design
    Dede, Mehmet
    Tosunoglu, Sabri
    INDUSTRIAL ROBOT-AN INTERNATIONAL JOURNAL, 2006, 33 (05) : 365 - 372
  • [46] ON ISSUES IN FAULT-TOLERANT COMPUTER DESIGN
    SOI, IM
    AGGARWAL, KK
    COMPUTERS & ELECTRICAL ENGINEERING, 1981, 8 (03) : 229 - 234
  • [47] Fault-tolerant thresholds for encoded ancillae with homogeneous errors
    Eastin, Bryan
    PHYSICAL REVIEW A, 2007, 75 (02):
  • [48] Design of a robust fault-tolerant multiplier
    Kasuga, Takeshi
    Kameyama, Michitaka
    Higuchi, Tatsuo
    Systems and Computers in Japan, 1991, 22 (02) : 10 - 18
  • [49] A Fault-Tolerant Combinational Circuit Design
    Ostanin, S.
    Kirienko, I.
    Lavrov, V.
    PROCEEDINGS OF 2015 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2015,
  • [50] DESIGN OF A FAULT-TOLERANT UNIVERSAL CELL
    LALA, PK
    BUSABA, F
    XIE, A
    YARLAGADDA, KC
    INTERNATIONAL JOURNAL OF ELECTRONICS, 1992, 72 (03) : 467 - 470