Improving the performance of checkpointing scheme with task duplication

被引:0
|
作者
Li, Kaiyuan [1 ]
Yang, Xiaozong [1 ]
机构
[1] Harbin Inst of Technology, Harbin, China
来源
关键词
Computer system recovery;
D O I
暂无
中图分类号
学科分类号
摘要
Checkpointing is a common technique for reducing the execution time of programs under the fault assumption. With the combination of checkpointing and task duplication, not only effective fault recovery but also perfect fault detection can be achieved. The overhead of such systems comes from two aspects:comparing and saving operation at each checkpoint, and the rollbacks caused by faults. This paper improves the method presented by Zlv and Bruck by employing incremental checkpointing. The improved method can reduce the overhead of comparing and saving operation, and moreover the rollbacks caused by latent faults can be avoided. Analysis show that thatour method exhibits better performance through comparison with that of Ziv and Bruck.
引用
收藏
页码:33 / 35
相关论文
共 50 条
  • [1] Performance optimization of checkpointing schemes with task duplication
    Li, Zhongwen
    Xiang, Yang
    Chen, Hong
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 671 - +
  • [2] Performance optimization of checkpointing schemes with task duplication
    Ziv, A
    Bruck, J
    IEEE TRANSACTIONS ON COMPUTERS, 1997, 46 (12) : 1381 - 1386
  • [3] Analysis of checkpointing schemes with task duplication
    Ziv, A
    Bruck, J
    IEEE TRANSACTIONS ON COMPUTERS, 1998, 47 (02) : 222 - 227
  • [4] Optimal checkpointing interval for task duplication with spare processing
    Nakagawa, S
    Okuda, Y
    Yamada, S
    NINTH ISSAT INTERNATIONAL CONFERENCE ON RELIABILITY AND QUALITY IN DESIGN, 2003 PROCEEDINGS, 2003, : 215 - 219
  • [5] Improving network performance through task duplication for parallel applications on clusters
    Qin, X
    CONFERENCE PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL PERFORMANCE, COMPUTING AND COMMUNICATIONS CONFERENCE, 2005, : 35 - 42
  • [6] A Smart Checkpointing Scheme for Improving the Reliability of Clustering Routing Protocols
    Min, Hong
    Jung, Jinman
    Kim, Bongjae
    Cho, Yookun
    Heo, Junyoung
    Yi, Sangho
    Hong, Jiman
    SENSORS, 2010, 10 (10) : 8938 - 8952
  • [7] A task duplication scheme for resolving deadlocks in clustered DAGs
    Arafeh, BR
    PARALLEL COMPUTING, 2003, 29 (06) : 795 - 820
  • [8] The Design and Performance of a Checkpointing Scheme for Mobile Ad Hoc Networks
    Tuli, Ruchi
    Kumar, Parveen
    ADVANCES IN PARALLEL, DISTRIBUTED COMPUTING, 2011, 203 : 204 - +
  • [9] ReCT: Improving MapReduce Performance under Failures with Resilient Checkpointing Tactics
    Wang, Hao
    Chen, Haopeng
    Hu, Fei
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [10] A Method for Improving the Reliability of the Gateway System by Using OSEK and Duplication Scheme
    Kim, J. H.
    Seo, S. H.
    Moon, T. Y.
    Kwon, K. H.
    Jeon, J. W.
    Hwang, S. H.
    2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, : 489 - +