Application fault tolerance with armore middleware

被引:22
|
作者
Kalbarczyk, Z [1 ]
Iyer, RK
Wang, L
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/MIC.2005.31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, which is often prohibitively expensive for practical use due to its high performance overhead and cost. The Adaptive Reconfigurable Mobile Objects of Reliability (Armor) middleware architecture offers a scalable low-overhead way to provide high-dependability services to applications. It uses coordinated multithreaded processes to manage redundant resources across interconnected nodes, detect errors in user applications and infrastructural components, and provide failure recovery. The authors describe their experiences and lessons learned in deploying Armor in several diverse fields.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 50 条
  • [1] Fault tolerance configuration for middleware services
    Li, Jun-Guo
    Huang, Gang
    Zou, Jian
    Mei, Hong
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2007, 30 (10): : 1696 - 1704
  • [2] CUMULVS: Extending a generic steering and visualization middleware for application fault-tolerance
    Papadopoulos, PM
    Kohl, JA
    Semeraro, BD
    [J]. PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL VII: SOFTWARE TECHNOLOGY TRACK, 1998, : 127 - 136
  • [3] Fault-Tolerance in XJAF Agent Middleware
    Ivanovic, Mirjana
    Ivkovic, Jovana
    Vidakovic, Milan
    Luburic, Nikola
    Badica, Costin
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT II, 2016, 9876 : 25 - 34
  • [4] Fault-tolerance in Universal Middleware Bridge
    Moon, Kyung-Deok
    Park, Jun Hee
    Kim, K. H.
    Zheng, Liangchen
    Zhou, Qian
    [J]. ISORC 2008: 11TH IEEE SYMPOSIUM ON OBJECT/COMPONENT/SERVICE-ORIENTED REAL-TIME DISTRIBUTED COMPUTING - PROCEEDINGS, 2008, : 471 - +
  • [5] Fault Tolerance Management for a Hierarchical GridRPC Middleware
    Bouteiller, Aurelien
    Desprez, Frederic
    [J]. CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 484 - 491
  • [6] Fault tolerance using standard reflexive middleware mechanisms
    Bennani, Mohamed Taha
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND NETWORKS, 2007, : 359 - 366
  • [7] Flexible fault tolerance in configurable middleware for embedded systems
    Dorow, K
    [J]. 27TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2003, : 563 - 569
  • [8] Application of fault injection to globus grid middleware
    Looker, Nik
    Xu, Jie
    Wo, Tianyu
    Huai, Jinpeng
    [J]. PROCEEDINGS OF THE UK E-SCIENCE ALL HANDS MEETING 2006, 2006, : 265 - 272
  • [9] Lightweight Fault-Tolerance for Peer-to-Peer Middleware
    Martins, Rolando
    Narasimhan, Priya
    Lopes, Luis
    Silva, Fernando
    [J]. 2010 29TH IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS SRDS 2010, 2010, : 313 - 317
  • [10] Middleware fault tolerance support for the BOSS embedded operating system
    Afonso, F.
    Silva, C.
    Montenegro, S.
    Tavares, A.
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON INTELLIGENT SOLUTIONS IN EMBEDDED SYSEMS, 2006, : 35 - +