Direct-dependency-based checkpointing and recovery technique for distributed systems

被引:0
|
作者
Shen, L [1 ]
Liu, H [1 ]
Gu, M [1 ]
Gupta, B [1 ]
机构
[1] So Illinois Univ, Dept Comp Sci, Carbondale, IL 62901 USA
关键词
recovery; communication-induced checkpointing; consistency;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an efficient checkpointing and recovery algorithm for distributed systems is presented. The concept of direct-dependency-based communication induced checkpointing is used with the exception that processes do not take any basic checkpoints. Each application message is required to piggyback only an integer. The algorithm to find out a set of the globally consistent checkpoints (GCC) is run simultaneously by all the participating processes, ensuring faster execution.
引用
收藏
页码:2123 / 2129
页数:7
相关论文
共 50 条
  • [1] Direct dependency-based fast recovery for distributed systems
    Gupta, B
    Liu, Z
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2002, : 124 - 129
  • [2] An index-based checkpointing/recovery approach for distributed systems
    Gupta, B
    Banerjee, SK
    Wang, Z
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2001, : 166 - 170
  • [3] CHECKPOINTING AND ROLLBACK-RECOVERY FOR DISTRIBUTED SYSTEMS
    KOO, R
    TOUEG, S
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (01) : 23 - 31
  • [4] Concurrent checkpointing & rollback recovery for distributed systems
    Ye, X
    Keane, JA
    [J]. EUROSIM '96 - HPCN CHALLENGES IN TELECOMP AND TELECOM: PARALLEL SIMULATION OF COMPLEX SYSTEMS AND LARGE-SCALE APPLICATIONS, 1996, : 211 - 218
  • [5] AN EFFICIENT PROTOCOL FOR CHECKPOINTING RECOVERY IN DISTRIBUTED SYSTEMS
    KIM, JL
    PARK, T
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (08) : 955 - 960
  • [6] Effective and concurrent checkpointing and recovery in distributed systems
    Hou, CJ
    Tsoi, KS
    Han, CC
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1997, 144 (05): : 304 - 316
  • [7] Efficient recovery approach in distributed systems with hybrid checkpointing
    Jiang, YX
    Gupta, B
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2000, : 292 - 297
  • [8] CHECKPOINTING AND ROLLBACK-RECOVERY ALGORITHMS IN DISTRIBUTED SYSTEMS
    DENG, Y
    PARK, EK
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 1994, 25 (01) : 59 - 71
  • [9] An efficient and scalable checkpointing and recovery algorithm for distributed systems
    Kumar, K. P. Krishna
    Hansdah, R. C.
    [J]. DISTRIBUTED COMPUTING AND NETWORKING, PROCEEDINGS, 2006, 4308 : 94 - 99
  • [10] Scalable Checkpointing-based Rollback Recovery Protocol For Geographically Distributed Systems
    Ahn, Jinho
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1492 - 1496