RESPONDING TO CATASTROPHIC ERRORS - A DESIGN TECHNIQUE FOR FAULT-TOLERANT SOFTWARE

被引:0
|
作者
DAVIS, FGF [1 ]
GANTENBEIN, RE [1 ]
机构
[1] UNIV WYOMING,DEPT COMP SCI,OPERATING SYST LAB,LARAMIE,WY 82071
关键词
D O I
10.1016/0164-1212(92)90113-X
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The usual classification of software-caused system errors as internal, external, or pervasive assumes a rippling propagation of errors through a hierarchy of structures. As a result, most fault-tolerant software handles errors through nested detection and recovery mechanisms. In many cases, particularly in distributed systems, this assumption may not hold; catastrophic errors may occur that can evade the boundaries of the usual mechanisms and cause large-scale system failure. System designers must consider the possibility of failure from the first stages of system development, define the circumstances under which these failures might occur, and analyze the costs of dealing with such failures. Fault-tolerance techniques can be applied to reduce the effect of catastrophic errors. One such technique, dynamic reconfiguration, is described here as an example of a practical way for a system to respond to a detected error. Dynamic reconfiguration can be used not only to recover from software errors but also to remove the faults that caused the errors. An example of the design of a life-critical software system using dynamic configuration to handle potentially catastrophic errors is presented.
引用
下载
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [31] Optimal structure of fault-tolerant software systems
    Levitin, G
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2005, 89 (03) : 286 - 295
  • [32] Towards reliability optimization of fault-tolerant software
    Belli, F.
    Bremerhaven, Hochschule
    Jedrzejowicz, P.
    IT - Information Technology, 1987, 29 (02): : 61 - 68
  • [33] Fault-tolerant Hybrid Quantum Software Systems
    Scheerer, Max
    Klamroth, Jonas
    Denninger, Oliver
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM SOFTWARE (IEEE QSW 2022), 2022, : 52 - 57
  • [34] FAULT-TOLERANT SOFTWARE-RELIABILITY MODELING
    SCOTT, RK
    GAULT, JW
    MCALLISTER, DF
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (05) : 582 - 592
  • [35] An exception handling software architecture for developing fault-tolerant software
    Garcia, AF
    Beder, DM
    Rubira, CMF
    FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2000, : 311 - 320
  • [36] Software package for the implementation of bioinspired algorithms for the design of fault-tolerant control systems
    Kovalev, I., V
    Saramud, M., V
    Testoyedov, N. A.
    Kovalev, D., I
    Kuznetsov, A. S.
    Koltashev, A. A.
    II INTERNATIONAL SCIENTIFIC CONFERENCE ON APPLIED PHYSICS, INFORMATION TECHNOLOGIES AND ENGINEERING 25, PTS 1-5, 2020, 1679
  • [37] Classification and design of fault-tolerant parallel
    Du, Yunfei
    Tang, Yuhua
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (04): : 49 - 52
  • [38] KINEMATIC DESIGN OF FAULT-TOLERANT MANIPULATORS
    PAREDIS, CJJ
    AU, WKF
    KHOSLA, PK
    COMPUTERS & ELECTRICAL ENGINEERING, 1994, 20 (03) : 211 - 220
  • [40] Design and Verification of Fault-Tolerant Components
    Zhang, Miaomiao
    Liu, Zhiming
    Morisset, Charles
    Ravn, Anders P.
    METHODS, MODELS AND TOOLS FOR FAULT TOLERANCE, 2009, 5454 : 57 - +