A quasi-synchronous approach for roll-forward recovery in distributed systems

被引:0
|
作者
Liu, H [1 ]
Shen, L [1 ]
Gu, M [1 ]
Gupta, B [1 ]
机构
[1] So Illinois Univ, Dept Comp Sci, Carbondale, IL 62901 USA
关键词
communication-induced checkpointing; synchronous checkpointing; forced checkpoints; consistency;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a quasi-synchronous approach for checkpointing / recover is proposed. It uses a new concept of forced checkpoint. It helps in ensuring a small re-execution time after recovery from a failure. The approach offers a very simple recovery scheme, comparable to that in synchronous approach, even though achieving synchronization among the processes is not required unlike in synchronous approach.
引用
收藏
页码:2117 / 2122
页数:6
相关论文
共 50 条
  • [1] Design of new roll-forward recovery approach for distributed systems
    Gupta, B
    Banerjee, SK
    Liu, B
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2002, 149 (03): : 105 - 112
  • [2] A hybrid roll-forward recovery scheme for distributed systems
    Gupta, B
    Mogharreban, N
    Banerjee, SK
    [J]. PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 48 - 54
  • [3] Quasi-synchronous approach for distributed control in synchronous systems
    Yeddes, M
    Mullins, J
    [J]. PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL (ISIC'01), 2001, : 231 - 235
  • [4] Novel low-overhead roll-forward recovery scheme for distributed systems
    Gupta, B.
    Rahimi, S.
    Liu, Z.
    [J]. IET COMPUTERS AND DIGITAL TECHNIQUES, 2007, 1 (04): : 397 - 404
  • [5] Roll-forward error recovery in embedded real-time systems
    Xu, J
    Randell, B
    [J]. 1996 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1996, : 414 - 421
  • [6] A Validation Approach for Quasi-Synchronous Checkpointing Algorithms in HPC Systems
    Khlif, Houda
    Kacem, Hatem Hadj
    Hernandez, Saul E. Pomares
    Kacem, Ahmed Hadj
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 622 - 629
  • [7] A New Roll-Forward Checkpointing / Recovery Mechanism for Cluster Federation
    Gupta, B.
    Rahimi, S.
    Ahmad, R.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (11): : 292 - 298
  • [8] Adaptive control in roll-forward recovery for extreme scale multigrid
    Huber, Markus
    Ruede, Ulrich
    Wohlmuth, Barbara
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (05): : 817 - 837
  • [9] An efficient validation approach for quasi-synchronous checkpointing oriented to distributed diagnosability
    Khlif, Houda
    Kacem, Hatem Hadj
    Pomares Hernandez, Saul E.
    Kacem, Ahmed Hadj
    Eichler, Cedric
    Calixto Simon, Alberto
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 122 : 364 - 377
  • [10] VERIFICATION OF QUASI-SYNCHRONOUS SYSTEMS WITH UPPAAL
    Bhattacharyya, S.
    Miller, S.
    Yang, J.
    Smolka, S.
    Meng, B.
    Sticksel, C.
    Tinelli, C.
    [J]. 2014 IEEE/AIAA 33RD DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2014,