High-coverage fault tolerance in real-time systems based on point-to-point communication

被引:0
|
作者
Kim, KH [1 ]
Subbaraman, C [1 ]
Shokri, E [1 ]
机构
[1] Univ Calif Irvine, Dept Elect & Comp Engn, Irvine, CA 92697 USA
关键词
distributed recovery block; network surveillance; point-to-point networks; real-time systems; fault-tolerance; fault coverage; recovery time bound;
D O I
10.1109/HASE.1997.648053
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which were outlined in recent years but not developed fully is the integration of the DRB scheme and a network surveillance (NS) scheme. We recently developed an NS scheme effective in a variety of point-to-point networks and it is called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in the systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [31] High Coverage Point-To-Point Transit: Hybrid evolutionary approach to local vehicle routing
    Jaeyoung Jung
    R. Jayakrishnan
    Doohee Nam
    KSCE Journal of Civil Engineering, 2015, 19 : 1882 - 1891
  • [32] Real-Time Fastest Path Algorithm using Bidirectional Point-to-Point Search on a Fuzzy Time-Dependent Transportation Network
    Laarabi, Mohamed Haitam
    Boulmakoul, Azedine
    Mabrouk, Aziz
    Sacile, Roberto
    Garbolino, Emmanuel
    2014 INTERNATIONAL CONFERENCE ON ADVANCED LOGISTICS & TRANSPORT (ICALT 2014), 2014, : 78 - 84
  • [33] Optimizing the Fault Tolerance Capabilities of Distributed Real-Time Systems
    Thekilakkattil, Abhilash
    Dobrin, Radu
    Punnekkat, Sasikumar
    Aysan, Huseyin
    2009 IEEE CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (EFTA 2009), 2009,
  • [34] AVIONICS HARD REAL-TIME SYSTEMS' CONCERNING FAULT TOLERANCE
    Loubach, Denis Silva
    da Cunha, Adilson Marques
    2012 IEEE/AIAA 31ST DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2012,
  • [35] Providing fault tolerance for active vision systems in real-time
    Fayman, JA
    Rivlin, E
    Mosse, D
    1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 2577 - 2582
  • [36] A fault-tolerance model for multiprocessor real-time systems
    Cheng, ST
    Chen, CM
    Tripathi, SK
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2000, 61 (03) : 457 - 477
  • [37] A least upper bound on the fault tolerance of real-time systems
    Santos, RA
    Santos, J
    Orozco, JD
    JOURNAL OF SYSTEMS AND SOFTWARE, 2005, 78 (01) : 47 - 55
  • [38] Process-Based Asynchronous Progress Model for MPI Point-to-Point Communication
    Si, Min
    Balaji, Pavan
    2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS), 2017, : 206 - 214
  • [39] A quality-of-service-based routing algorithm for point-to-point multimedia communication
    Wang, XW
    Cai, GQ
    Liu, JR
    2000 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY PROCEEDINGS, VOLS. I & II, 2000, : 1613 - 1616
  • [40] A Study of Chemical Reactions in Point-to-Point Diffusion-Based Molecular Communication
    Abin, Hamidreza
    Gohari, Amin
    Nasiri-Kenari, Masoumeh
    IEEE ACCESS, 2023, 11 : 24752 - 24767