Local decisions and triggering mechanisms for adaptive fault-tolerance

被引:0
|
作者
Stanley-Marbell, P [1 ]
Marculescu, D [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/DATE.2004.1269018
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic fault-tolerance management (DFTM) was previously introduced as a means of providing environment and workload-driven adaptation for failure-prone battery powered systems. This paper introduces and analyzes the role of local decision policies in a DFTM environment, and presents a precise formulation for when it is beneficial to activate a given DFTM algorithm with respect to metrics that combine performance, reliability, power consumption and battery life. In particular, local decision algorithms are described in the context of an imaging array application running on a network of resource-constrained processing elements. It is demonstrated that DFTM algorithms, in conjunction with appropriately chosen activation times, increase the mean computation before battery failure for a single battery, by a factor between 1.1 to 5.8, for the application investigated.
引用
收藏
页码:968 / 973
页数:6
相关论文
共 50 条
  • [1] Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS
    Lauer, Michael
    Amy, Matthieu
    Fabre, Jean-Charles
    Roy, Matthieu
    Excoffon, William
    Stoicescu, Miruna
    [J]. 2016 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE), 2016, : 94 - 101
  • [2] FAULT-TOLERANCE IN MULTICHANNEL LOCAL AREA NETWORKS
    CAMARDA, P
    GERLA, M
    [J]. EIGHTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS: 1989 CONFERENCE PROCEEDINGS, 1989, : 133 - 137
  • [3] FAULT-TOLERANCE
    GROSSPIETSCH, KE
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 783 - 783
  • [4] Designing masking fault-tolerance via nonmasking fault-tolerance
    Arora, A
    Kulkarni, SS
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 435 - 450
  • [5] ON FAULT-TOLERANCE MECHANISMS IN DISTRIBUTED COMPUTER-SYSTEMS
    EBERBACH, E
    JUST, JR
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1985, 16 (4-5): : 239 - 244
  • [6] Adaptive Fault-Tolerance for Cyber-Physical Systems
    Krishna, C. M.
    Koren, I.
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2013,
  • [7] Assessing the reliability impacts of software fault-tolerance mechanisms
    Mendiratta, VB
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1996, : 99 - 103
  • [8] Formal validation of fault-tolerance mechanisms inside GUARDS
    Bernardeschi, C
    Fantechi, A
    Gnesi, S
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2001, 71 (03) : 261 - 270
  • [9] ON FAULT-TOLERANCE OF SYNTAX
    SLISSENKO, AO
    [J]. THEORETICAL COMPUTER SCIENCE, 1993, 119 (01) : 215 - 222
  • [10] ABSTRACTIONS FOR FAULT-TOLERANCE
    CRISTIAN, F
    [J]. INFORMATION PROCESSING '94, VOL III: LINKAGE AND DEVELOPING COUNTRIES, 1994, 53 : 278 - 286