A new log-based approach to independent recovery in distributed shared memory systems

被引:0
|
作者
Lin, JW [1 ]
Kuo, SY [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 106, Taiwan
关键词
rollback recovery; distributed shared memory; independent checkpointing; logging; trace-driven simulation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the problem of rollback recovery in distributed shared memory (DSM) systems. We propose a new log-based recovery approach, which can tolerate multiple node failures. The recovery approach employs an independent checkpointing technique and a new logging scheme. The independent checkpointing technique periodically interrupts the execution of a node to save the node's state. The new logging scheme takes advantage of the DSM's unique properties to reduce the logging overhead. Based on the proposed recovery approach, the pre-failure state of a faulty node can be deterministically created without involving any fault-free node. In addition, some consistency information may be lost after a node becomes faulty. To reconstruct the lost consistency information, we also present an efficient consistency reconstruction method in this paper. Finally, extensive trace-driven simulations are performed to show the effectiveness of the new logging scheme.
引用
收藏
页码:271 / 290
页数:20
相关论文
共 50 条
  • [1] New log-based approach to independent recovery in distributed shared memory systems
    Lin, Jenn-Wei
    Kuo, Sy-Yen
    2000, IIS, Taipei, Taiwan (16)
  • [2] Log-based rollback recovery without checkpoints of shared memory in software DSM
    Park, S
    Maeng, SR
    JOURNAL OF SUPERCOMPUTING, 2006, 35 (02): : 141 - 154
  • [3] Log-Based Rollback Recovery without Checkpoints of Shared Memory in Software DSM
    Soyeon Park
    Seung Ryoul Maeng
    The Journal of Supercomputing, 2006, 35 : 141 - 154
  • [4] Distributed Log-based Reconciliation
    Chong, Yek Loong
    Hamadi, Youssef
    ECAI 2006, PROCEEDINGS, 2006, 141 : 108 - +
  • [5] Falcon: A Practical Log-based Analysis Tool for Distributed Systems
    Neves, Francisco
    Machado, Nuno
    Pereira, Jose
    2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2018, : 534 - 541
  • [6] The design of efficient initialization and crash recovery for log-based file systems over flash memory
    National Taiwan University
    不详
    不详
    不详
    不详
    ACM Trans. Storage, 2006, 4 (449-467):
  • [7] FASTM: A Log-based Hardware Transactional Memory with Fast Abort Recovery
    Lupon, Marc
    Magklis, Grigorios
    Gonzalez, Antonio
    18TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2009, : 293 - +
  • [8] Fault recovery for distributed shared memory systems
    Dieter, WR
    Lumpp, JE
    1997 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOL 2, 1997, : 525 - 540
  • [9] LogTM: Log-based transactional memory
    Moore, Kevin E.
    Bobba, Jayararn
    Moravan, Michelle J.
    Hill, Mark D.
    Wood, David A.
    TWELFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2006, : 258 - +
  • [10] A Low-Memory Management for Log-based File Systems on Flash Memory
    Yang, Shun-Fa
    Wu, Chin-Hsien
    2009 15TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 219 - 227