Adding fault-tolerance to a hierarchical DRE system

被引:0
|
作者
Rubel, Paul [1 ]
Loyall, Joseph [1 ]
Schantz, Richard [1 ]
Gillen, Matthew [1 ]
机构
[1] BBN Technol, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Dynamic resource management is a crucial part of the infrastructure for emerging mission-critical distributed real-time embedded system. Because of this, the resource manager must be fault-tolerant, with nearly continuous operation. This paper describes an ongoing effort to develop a fault-tolerant multilayer dynamic resource management capability and the challenges we have encountered, including multi-tiered structure, rapid recovery, the characteristics of component middleware, and the co-existence of replicated and non-replicated elements. While some of these have been investigated before, this work exhibits all of these characteristics simultaneously, presenting a significant fault-tolerance research challenge.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [1] The complexity of adding failsafe fault-tolerance
    Kulkarni, SS
    Ebnenasir, A
    [J]. 22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, : 337 - 344
  • [2] Fault-tolerance schemes for hierarchical mesh networks
    Zurawski, J
    Wang, DJ
    [J]. PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 498 - 502
  • [3] MODELING OF HIERARCHICAL DISTRIBUTED SYSTEMS WITH FAULT-TOLERANCE
    SHIEH, YB
    GHOSAL, D
    CHINTAMANENI, PR
    TRIPATHI, SK
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1990, 16 (04) : 444 - 457
  • [4] Adding fault-tolerance using pre-synthesized components
    Kulkarni, SS
    Ebnenasir, A
    [J]. DEPENDABLE COMPUTING - EDCC-5, PROCEEDINGS, 2005, 3463 : 72 - 90
  • [5] Fault-Tolerance of Hierarchical Power Management in Data Center
    Li, Jianxiang
    Lv, Yinan
    Kong, Xiangzhen
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS II, PTS 1-3, 2013, 336-338 : 2555 - 2558
  • [6] Designing a resourceful fault-tolerance system
    Giguette, R
    Hassell, J
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2002, 62 (01) : 47 - 57
  • [7] FAULT-TOLERANCE
    GROSSPIETSCH, KE
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 783 - 783
  • [8] Designing masking fault-tolerance via nonmasking fault-tolerance
    Arora, A
    Kulkarni, SS
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 435 - 450
  • [9] Hierarchical Byzantine fault-tolerance protocol for permissioned blockchain systems
    Quang Tung Thai
    Yim, Jong-Chul
    Yoo, Tae-Whan
    Yoo, Hyun-Kyung
    Kwak, Ji-Young
    Kim, Sun-Me
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (11): : 7337 - 7365
  • [10] Hierarchical Byzantine fault-tolerance protocol for permissioned blockchain systems
    Quang Tung Thai
    Jong-Chul Yim
    Tae-Whan Yoo
    Hyun-Kyung Yoo
    Ji-Young Kwak
    Sun-Me Kim
    [J]. The Journal of Supercomputing, 2019, 75 : 7337 - 7365