Performance optimization of checkpointing schemes with task duplication

被引:0
|
作者
Li, Zhongwen [1 ,3 ]
Xiang, Yang [2 ]
Chen, Hong [1 ]
机构
[1] Xiamen Univ, Informat Sci & Technol Coll, Xiamen 361005, Peoples R China
[2] Deakin Univ, Sch Engn& Informat Technol, Geelong, Vic, Australia
[3] Zhongshan Inst UESTC, Zhongshan 528402, Peoples R China
关键词
fault-tolerant computing; checkpointing intervals; task duplication; performance optimization;
D O I
10.1109/IMSCCS.2006.250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using store-checkpoints (SCPs) and compare-checkpoints (CCPs), we present an adaptive checkpointing scheme that dynamically adjusts the checkpointing interval on line in this paper With additional SCPs and CCPs, we can use both the comparison and storage operations in an efficient way and improve the performance of checkpointing schemes. Further we obtain methods to calculate the optimal numbers of checkpoints by which minimize the mean execution times. Simulation results show that compared to previous methods, the proposed approach significantly increases the likelihood of timely task completion in the present of faults.
引用
收藏
页码:671 / +
页数:2
相关论文
共 50 条
  • [21] The performance of coordinated and independent checkpointing
    Silva, LM
    Silva, JG
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 280 - 284
  • [22] PERFORMANCE ANALYSIS OF CHECKPOINTING STRATEGIES
    TANTAWI, AN
    RUSCHITZKA, M
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1984, 2 (02): : 123 - 144
  • [23] Performance of coordinated and independent checkpointing
    Universidade de Coimbra, Coimbra, Portugal
    Proc Int Parall Process Symp IPPS, (280-284):
  • [24] Optimizing Checkpointing Performance in Spark
    Zhang, Ya-Meng
    Luo, Yu
    Li, Yan-Chen
    3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA 2017), 2017, : 9 - 13
  • [25] Optimal checkpointing for adjoint multistage time-stepping schemes
    Zhang, Hong
    Constantinescu, Emil M.
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 66
  • [26] Checkpointing schemes for fast restart in main memory database systems
    Lee, D
    Cho, H
    1997 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2: PACRIM 10 YEARS - 1987-1997, 1997, : 663 - 668
  • [27] Optimal checkpointing interval for two-level recovery schemes
    Naruse, K
    Umemura, S
    Nakagawa, S
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 51 (02) : 371 - 376
  • [28] Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication
    Tan, Nigel
    Luettgau, Jakob
    Marquez, Jack
    Terianishi, Keita
    Morales, Nicolas
    Bhowmick, Sanjukta
    Cappello, Franck
    Taufer, Michela
    Nicolae, Bogdan
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 665 - 674
  • [29] Performance optimization for energy-aware adaptive checkpointing in embedded real-time systems
    Li, Zhongwen
    Chen, Hong
    Yu, Shui
    2006 DESIGN AUTOMATION AND TEST IN EUROPE, VOLS 1-3, PROCEEDINGS, 2006, : 676 - +
  • [30] Consistent checkpointing for high performance clusters
    Nishioka, T
    Hori, A
    Ishikawa, Y
    CLUSTER 2000: IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2000, : 367 - 368