Soft-Checkpointing Based Hybrid Synchronous Checkpointing Protocol for Mobile Distributed Systems

被引:15
|
作者
Kumar, Parveen [1 ]
Garg, Rachit [2 ]
机构
[1] Meerut Inst Engn & Technol, Grp Inst Prof CSE, Meerut, Uttar Pradesh, India
[2] Singhania Univ, Jhunjhunu, India
关键词
Checkpoint; Consistent Global State; Coordinated Checkpointing and Mobile Systems; Fault Tolerance; Probabilistic Approach;
D O I
10.4018/jdst.2011010101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Minimum-process coordinated checkpointing is a suitable approach to introduce fault tolerance in mobile distributed systems transparently. In order to balance the checkpointing overhead and the loss of computation on recovery, the authors propose a hybrid checkpointing algorithm, wherein an all-process coordinated checkpoint is taken after the execution of minimum-process coordinated checkpointing algorithm for a fixed number of times. In coordinated checkpointing, if a single process fails to take its checkpoint; all the checkpointing effort goes waste, because, each process has to abort its tentative checkpoint. In order to take the tentative checkpoint, an MH (Mobile Host) needs to transfer large checkpoint data to its local MSS over wireless channels. In this regard, the authors propose that in the first phase, all concerned MHs will take soft checkpoint only. Soft checkpoint is similar to mutable checkpoint. In this case, if some process fails to take checkpoint in the first phase, then MHs need to abort their soft checkpoints only. The effort of taking a soft checkpoint is negligibly small as compared to the tentative one. In the minimum-process coordinated checkpointing algorithm, an effort has been made to minimize the number of useless checkpoints and blocking of processes using probabilistic approach.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] PQPCkpt: An efficient three level synchronous checkpointing scheme in mobile computing systems
    Lin, CM
    Dow, CR
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (11): : 1556 - 1567
  • [32] Distributed checkpointing based on influential messages
    Tanaka, K
    Takizawa, M
    1996 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1996, : 440 - 447
  • [33] FNB: Fast Non-Blocking Coordinated Checkpointing Protocol for Distributed Systems
    Abdelhafidi, Zohra
    Djoudi, Mohamed
    Lagraa, Nasreddine
    Yagoubi, Mohamed Bachir
    THEORY OF COMPUTING SYSTEMS, 2015, 57 (02) : 397 - 425
  • [34] An index-based checkpointing algorithm for autonomous distributed systems
    Baldoni, R
    Quaglia, F
    Fornara, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (02) : 181 - 192
  • [35] An index-based checkpointing/recovery approach for distributed systems
    Gupta, B
    Banerjee, SK
    Wang, Z
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 166 - 170
  • [36] An Optimum Checkpointing-Based Fault Tolerant Algorithm Using Mobile Agent in Distributed Systems
    Zeinalabedin, Farid Haji
    Eftekhari, Nassrin
    Haghighat, Abolfazl Torghi
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 93 - +
  • [37] An index-based checkpointing algorithm for autonomous distributed systems
    Baldoni, R
    Quaglia, F
    Fornara, P
    SIXTEENTH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 27 - 34
  • [38] CHECKPOINTING AND ROLLBACK-RECOVERY FOR DISTRIBUTED SYSTEMS
    KOO, R
    TOUEG, S
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (01) : 23 - 31
  • [39] FNB: Fast Non-Blocking Coordinated Checkpointing Protocol for Distributed Systems
    Zohra Abdelhafidi
    Mohamed Djoudi
    Nasreddine Lagraa
    Mohamed Bachir Yagoubi
    Theory of Computing Systems, 2015, 57 : 397 - 425
  • [40] A low-overhead checkpointing protocol for mobile networks
    Ahmed, RE
    Khaliq, A
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1779 - 1782