Software fault detection and recovery in critical real-time systems: An approach based on loose coupling

被引:6
|
作者
Alho, Pekka [1 ]
Mattila, Jouni [1 ]
机构
[1] Tampere Univ Technol, Dept Intelligent Hydraul & Automat, FIN-33101 Tampere, Finland
关键词
ITER; Remote handling; Software; Fault tolerance; Dependability; Real-time; TOLERANCE;
D O I
10.1016/j.fusengdes.2014.04.050
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
Remote handling (RH) systems are used to inspect, make changes to, and maintain components in the ITER machine and as such are an example of mission-critical system. Failure in a critical system may cause damage, significant financial losses and loss of experiment runtime, making dependability one of their most important properties. However, even if the software for RH control systems has been developed using best practices, the system might still fail due to undetected faults (bugs), hardware failures, etc. Critical systems therefore need capability to tolerate faults and resume operation after their occurrence. However, design of effective fault detection and recovery mechanisms poses a challenge due to timeliness requirements, growth in scale, and complex interactions. In this paper we evaluate effectiveness of service-oriented architectural approach to fault tolerance in mission-critical real-time systems. We use a prototype implementation for service management with an experimental RH control system and industrial manipulator. The fault tolerance is based on using the high level of decoupling between services to recover from transient faults by service restarts. In case the recovery process is not successful, the system can still be used if the fault was not in a critical software module. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:2272 / 2277
页数:6
相关论文
共 50 条
  • [31] Software Fault Tolerance in Real-Time Systems: Identifying the Future Research Questions
    Reghenzani, Federico
    Guo, Zhishan
    Fornaciari, William
    ACM COMPUTING SURVEYS, 2023, 55 (14S)
  • [32] Combining software and hardware monitoring for fault diagnosis of complex real-time systems
    Liu, YB
    Zhu, XD
    Gan, MZ
    ICEMI 2005: CONFERENCE PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL 7, 2005, : 732 - 738
  • [33] A MODULAR APPROACH TO REAL-TIME SOFTWARE
    CHAO, A
    COMPUTER DESIGN, 1986, 25 (18): : 85 - 88
  • [34] AN ARCHITECTURE FOR REAL-TIME SOFTWARE SYSTEMS
    BAKER, TP
    SCALLON, GM
    IEEE SOFTWARE, 1986, 3 (03) : 50 - 58
  • [35] SOFTWARE ASPECTS IN REAL-TIME SYSTEMS
    RZEHAK, H
    MICROPROCESSING AND MICROPROGRAMMING, 1989, 27 (1-5): : 511 - 511
  • [36] An evolving approach to unsupervised and Real-Time fault detection in industrial processes
    Bezerra, Clauber Gomes
    Jales Costa, Bruno Sielly
    Guedes, Luiz Affonso
    Angelov, Plamen Parvanov
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 63 : 134 - 144
  • [37] An Automated Machine Learning Approach for Real-Time Fault Detection and Diagnosis
    Leite, Denis
    Martins, Aldonso, Jr.
    Rativa, Diego
    De Oliveira, Joao F. L.
    Maciel, Alexandre M. A.
    SENSORS, 2022, 22 (16)
  • [38] Fault detection in real-time software systems using neuro-fuzzy tools for signal validation and diagnosis
    Thunem, HPJ
    Thunem, APJ
    PROBABILISTIC SAFETY ASSESSMENT AND MANAGEMENT, VOL 1- 6, 2004, : 1858 - 1863
  • [39] A real-time simulation tool for fault detection & diagnosis of HVAC systems
    Ma, Yue
    Zaheeruddin, M.
    2006 XI'AN INTERNATIONAL CONFERENCE OF ARCHITECTURE AND TECHNOLOGY, PROCEEDINGS: ARCHITECTURE IN HARMONY, 2006, : 676 - 683
  • [40] Real-time fault detection for advanced maintenance of sustainable technical systems
    Ait-Alla, Abderrahim
    Luetjen, Michael
    Lewandowski, Marco
    Freitag, Michael
    Thoben, Klaus-Dieter
    RESEARCH AND INNOVATION IN MANUFACTURING: KEY ENABLING TECHNOLOGIES FOR THE FACTORIES OF THE FUTURE - PROCEEDINGS OF THE 48TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2016, 41 : 295 - 300