RESPONDING TO CATASTROPHIC ERRORS - A DESIGN TECHNIQUE FOR FAULT-TOLERANT SOFTWARE

被引:0
|
作者
DAVIS, FGF [1 ]
GANTENBEIN, RE [1 ]
机构
[1] UNIV WYOMING,DEPT COMP SCI,OPERATING SYST LAB,LARAMIE,WY 82071
关键词
D O I
10.1016/0164-1212(92)90113-X
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The usual classification of software-caused system errors as internal, external, or pervasive assumes a rippling propagation of errors through a hierarchy of structures. As a result, most fault-tolerant software handles errors through nested detection and recovery mechanisms. In many cases, particularly in distributed systems, this assumption may not hold; catastrophic errors may occur that can evade the boundaries of the usual mechanisms and cause large-scale system failure. System designers must consider the possibility of failure from the first stages of system development, define the circumstances under which these failures might occur, and analyze the costs of dealing with such failures. Fault-tolerance techniques can be applied to reduce the effect of catastrophic errors. One such technique, dynamic reconfiguration, is described here as an example of a practical way for a system to respond to a detected error. Dynamic reconfiguration can be used not only to recover from software errors but also to remove the faults that caused the errors. An example of the design of a life-critical software system using dynamic configuration to handle potentially catastrophic errors is presented.
引用
下载
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [1] A system without fault-tolerant software design
    Cawlfield, D
    INTECH, 1999, 46 (11) : 44 - 46
  • [2] FAULT-TOLERANT SOFTWARE
    HECHT, H
    IEEE TRANSACTIONS ON RELIABILITY, 1979, 28 (03) : 227 - 232
  • [3] Fault-tolerant Strategies and Their Design Methods for Application Software
    高建华
    邵世煌
    Journal of Donghua University(English Edition), 1998, (01) : 21 - 24
  • [4] The Fault-tolerant Software Design in Warship Control System
    Wen, Hong
    Liu, Yong
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 512 - 515
  • [5] FAULT-TOLERANT SOFTWARE - PROLOG
    MEYER, JF
    PHAM, H
    IEEE TRANSACTIONS ON RELIABILITY, 1993, 42 (02) : 177 - 178
  • [6] A design tool for large scale fault-tolerant software systems
    Blackmon, CL
    Yin, ML
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2004 PROCEEDINGS, 2004, : 256 - 260
  • [7] EMBEDDING FAULT-TOLERANT PROPERTIES IN THE DESIGN OF COMPLEX SOFTWARE SYSTEMS
    PERKUSICH, A
    DEFIGUEIREDO, JCA
    CHANG, SK
    JOURNAL OF SYSTEMS AND SOFTWARE, 1994, 25 (01) : 23 - 37
  • [8] Formal verification of fault-tolerant software design: the CSP approach
    Yeung, WL
    Schneider, SA
    MICROPROCESSORS AND MICROSYSTEMS, 2005, 29 (05) : 197 - 209
  • [9] Design and modeling of hybrid fault-tolerant software with cost constraints
    Wu, J
    Fernandez, EB
    Zhang, MX
    JOURNAL OF SYSTEMS AND SOFTWARE, 1996, 35 (02) : 141 - 149
  • [10] Catastrophic Failure and Fault-Tolerant Design of IGBT Power Electronic Converters - An Overview
    Wu, Rui
    Blaabjerg, Frede
    Wang, Huai
    Liserre, Marco
    Iannuzzo, Francesco
    39TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2013), 2013, : 507 - 513