A new log-based approach to independent recovery in distributed shared memory systems

被引:0
|
作者
Lin, JW [1 ]
Kuo, SY [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 106, Taiwan
关键词
rollback recovery; distributed shared memory; independent checkpointing; logging; trace-driven simulation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the problem of rollback recovery in distributed shared memory (DSM) systems. We propose a new log-based recovery approach, which can tolerate multiple node failures. The recovery approach employs an independent checkpointing technique and a new logging scheme. The independent checkpointing technique periodically interrupts the execution of a node to save the node's state. The new logging scheme takes advantage of the DSM's unique properties to reduce the logging overhead. Based on the proposed recovery approach, the pre-failure state of a faulty node can be deterministically created without involving any fault-free node. In addition, some consistency information may be lost after a node becomes faulty. To reconstruct the lost consistency information, we also present an efficient consistency reconstruction method in this paper. Finally, extensive trace-driven simulations are performed to show the effectiveness of the new logging scheme.
引用
收藏
页码:271 / 290
页数:20
相关论文
共 50 条
  • [21] Efficient recovery from communication errors in distributed shared memory systems
    Natl Taiwan Univ, Taipei, Taiwan
    IEICE Trans Inf Syst, 11 (1213-1223):
  • [22] Analysis of failure recovery schemes for distributed shared-memory systems
    Kim, JH
    Vaidya, NH
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1999, 146 (03): : 125 - 130
  • [23] ENSURING CORRECT ROLLBACK RECOVERY IN DISTRIBUTED SHARED-MEMORY SYSTEMS
    JANSSENS, B
    FUCHS, WK
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 29 (02) : 211 - 218
  • [24] Ω line problem in optimistic log-based rollback recovery protocol
    Baik, M
    Choi, S
    Hwang, C
    Gil, J
    Park, C
    Yoo, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1834 - 1842
  • [25] AN EFFICIENT LOG-BASED CRASH RECOVERY SCHEME FOR NESTED TRANSACTIONS
    SHIN, DC
    MOON, SC
    MICROPROCESSING AND MICROPROGRAMMING, 1991, 31 (1-5): : 99 - 104
  • [26] Lazy logging and prefetch-based crash recovery in software distributed shared memory systems
    Kongmunvattana, A
    Tzeng, NF
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 399 - 406
  • [27] Support for software interrupts in log-based rollback-recovery
    Slye, JH
    Elnozahy, EN
    IEEE TRANSACTIONS ON COMPUTERS, 1998, 47 (10) : 1113 - 1123
  • [28] Log-based anomaly detection for distributed systems: State of the art, industry experience, and open issues
    Wei, Xinjie
    Wang, Jie
    Sun, Chang-ai
    Towey, Dave
    Zhang, Shoufeng
    Zuo, Wanqing
    Yu, Yiming
    Ruan, Ruoyi
    Song, Guyang
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (08)
  • [29] An optimistic-based partition-processing approach for distributed shared memory systems
    Lin, JW
    Kuo, SY
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2002, 18 (06) : 853 - 869