A case for two-level recovery schemes

被引:31
|
作者
Vaidya, NH [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
failure recovery; performance analysis; checkpointing and rollback; recovery overhead; Markov chains;
D O I
10.1109/12.689645
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Long-running applications are often subject to failures. Failures can result in significant loss of computation. Therefore, it is necessary to use a failure recovery scheme to minimize performance overhead in the presence of failures. In this paper, we argue that it is often advantageous to use "two-level" recovery schemes. A two-level recovery scheme tolerates the more probable failures with low performance overhead, while the less probable failures may possibly incur a higher overhead. By minimizing overhead for the more frequently occurring failure scenarios, the two-level approach can achieve lower performance overhead (on average) as compared to existing recovery schemes. The paper describes two two-level recovery schemes. Performance analysis using a Markov chain shows that, in practice, a two-level scheme can perform better than its "one-level" counterpart. While the conclusions of this paper are intuitive, the work on design of appropriate recovery schemes is lacking. The objective of this paper is to motivate research into recovery schemes that can provide multiple levels of fault tolerance and achieve better performance than existing recovery schemes. The paper presents an analytical approach for evaluating performance of two-level schemes and shows that such schemes are hard to optimize analytically.
引用
收藏
页码:656 / 666
页数:11
相关论文
共 50 条
  • [31] A two-level cosimulation environment
    Fornaciari, W
    Sciuto, D
    Salice, F
    COMPUTER, 1997, 30 (06) : 109 - 111
  • [32] Two-Level Convergence Speedup Schemes for p-CMFD Acceleration in Neutron Transport Calculation
    Yuk, Seungsu
    Cho, Nam Zin
    NUCLEAR SCIENCE AND ENGINEERING, 2017, 188 (01) : 1 - 14
  • [33] Two-level volume rendering
    Hauser, H
    Mroz, L
    Bischi, GI
    Gröller, ME
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2001, 7 (03) : 242 - 252
  • [34] Two-level modeling of quarantine
    Khain, Evgeniy
    PHYSICAL REVIEW E, 2020, 102 (02)
  • [35] Two-level factorial experiments
    Byran Smucker
    Martin Krzywinski
    Naomi Altman
    Nature Methods, 2019, 16 : 211 - 212
  • [36] A critique of two-level approximation
    Savran, K
    Hakioglu, T
    Realizing Controllable Quantum States: MESOSCOPIC SUPERCONDUCTIVITY AND SPINTRONICS, 2005, : 233 - 240
  • [37] Two-level systems with relaxation
    Rourke, DE
    Khodarinova, L
    Karabanov, AA
    PHYSICAL REVIEW LETTERS, 2004, 92 (16) : 163003 - 1
  • [38] A Two-Level Perspective on Preference
    Fenrong Liu
    Journal of Philosophical Logic, 2011, 40 : 421 - 439
  • [39] A class of two-level explicit difference schemes for solving three dimensional heat conduction equation
    Zeng, WP
    APPLIED MATHEMATICS AND MECHANICS-ENGLISH EDITION, 2000, 21 (09) : 1071 - 1078
  • [40] A Two-Level Perspective on Preference
    Liu, Fenrong
    JOURNAL OF PHILOSOPHICAL LOGIC, 2011, 40 (03) : 421 - 439