An aperiodic checkpointing strategy in desktop grids

被引:0
|
作者
Wang, Dongping [1 ]
Gong, Bin [1 ]
机构
[1] Shandong Univ, Inst Comp Sci & Technol, Dept Comp Sci & Technol, Middle Shunhua Rd,New & Hi-Tech Dev Zone, Jinan 250101, Peoples R China
关键词
aperiodic checkpointing strategy; checkpointing strategy; desktop grids; volunteer computing; fault tolerance; dynamic checkpointing strategy; sample-based checkpointing strategy; checkpoint placement; checkpoint intervals; resource failures; failure distributions;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the context of desktop grids, resource failures occur frequently and checkpointing is helpful in accelerating the completion of long-running jobs. Since failure distributions of individual hosts in desktop grids are diverse and are hard to model, checkpointing strategies based on a closed form of failure distribution are not good choices for such cases. We propose an aperiodic checkpointing strategy which dynamically sets checkpoint according to history information of interfailure lengths. This strategy works for each individual host based on a sample of its history availability interval lengths. Rather than deriving a closed form for the probability distribution of availability interval lengths, the strategy directly approximates it with the sample. We conduct trace driven simulations to compare this strategy and periodic strategy, and results testify its effectiveness. Besides an unsuccessful attempt to improve the strategy, a requirement of using the strategy is detailed in this paper.
引用
收藏
页码:244 / 252
页数:9
相关论文
共 50 条
  • [1] Improving fault tolerance in desktop grids based on incremental checkpointing
    El-Desoky, Ali E.
    Ali, Hisham A.
    Azab, Abdulrahman A.
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2006, : 386 - +
  • [2] Rescheduling and Checkpointing as Strategies to Run Synchronous Parallel Programs on P2P Desktop Grids
    Righi, Rodrigo da Rosa
    Veith, Alexandre
    Rodrigues, Vinicius Facco
    Rostirolla, Gustavo
    da Costa, Cristiano Andre
    Farias, Kleinner
    Alberti, Antonio Marcos
    [J]. 30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 501 - 504
  • [3] DMTCP: Transparent Checkpointing for Cluster Computations and the Desktop
    Ansel, Jason
    Arya, Kapil
    Cooperman, Gene
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 895 - +
  • [4] A job checkpointing system for computational grids
    Amoon, Mohammed
    [J]. OPEN COMPUTER SCIENCE, 2013, 3 (01): : 17 - 26
  • [5] Towards "Chemical" Desktop Grids
    Banatre, Jean-Pierre
    Le Scouarnec, Nicolas
    Priol, Thierry
    Radenac, Yann
    [J]. E-SCIENCE 2007: THIRD IEEE INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2007, : 135 - 142
  • [6] EDGeS: A bridge between desktop grids and service grids
    Fedak, Gilles
    He, Haiwu
    Lodygensky, Oleg
    Balaton, Zoltan
    Farkas, Zoltan
    Gombas, Gabor
    Kacsuk, Peter
    Lovas, Robert
    Marosi, Attila Csaba
    Kelley, Ian
    Taylor, Ian
    Terstyanszky, Gabor
    Kiss, Tamas
    Cardenas-Montes, Miguel
    Emmen, Ad
    Araujo, Filipe
    [J]. PROCEEDINGS OF THE THIRD CHINAGRID ANNUAL CONFERENCE, 2008, : 3 - 3
  • [7] Utilizing the EGEE infrastructure for desktop grids
    Farkas, Zoltan
    Kacsuk, Peter
    del Solar, Manuel Rubio
    [J]. DISTRIBUTED AND PARALLEL SYSTEMS: IN FOCUS: DESKTOP GRID COMPUTING, 2008, : 27 - +
  • [8] Dynamic scheduling for heterogeneous Desktop Grids
    Al-Azzoni, Issam
    Down, Douglas G.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (12) : 1231 - 1240
  • [9] Availability traces of enterprise desktop grids
    Kondo, Derrick
    Fedak, Gilles
    Cappello, Franck
    Chien, Andrew A.
    Casanova, Henri
    [J]. 2006 7TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2006, : 301 - 302
  • [10] Probabilistic allocation of tasks on desktop grids
    Wingstrom, Joshua
    Casanova, Henri
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2834 - 2841