Checkpointing speculative distributed shared memory

被引:0
|
作者
Danilecki, Arkadiusz [1 ]
Kobusinska, Anna [1 ]
Szychowiak, Michal [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a checkpointing mechanism destined for Distributed Shared Memory (DSM) systems with speculative pre-fetching. Speculation is a general technique involving prediction of the future of a computation, namely accesses to shared objects unavailable on the accessing node (read faults). Thanks to such predictions objects can be fetched before the actual access operation is performed, resulting, at least potentially, in considerable performance improvement. The proposed mechanism is based on independent checkpointing integrated with a coherence protocol for a given consistency model introducing little overhead. It ensures the consistency of checkpoints, allowing fast recovery from failures.
引用
收藏
页码:9 / 16
页数:8
相关论文
共 50 条
  • [1] Checkpointing distributed shared memory
    Silva, LM
    Silva, JG
    [J]. JOURNAL OF SUPERCOMPUTING, 1997, 11 (02): : 137 - 158
  • [2] Checkpointing Distributed Shared Memory
    Luis M. Silva
    João Gabriel Silva
    [J]. The Journal of Supercomputing, 1997, 11 : 137 - 158
  • [3] Speculative Memory Checkpointing
    Vogt, Dirk
    Miraglia, Armando
    Portokalidis, Georgios
    Bos, Herbert
    Tanenbaum, Andy
    Giuffrida, Cristiano
    [J]. PROCEEDINGS OF THE 16TH ANNUAL MIDDLEWARE CONFERENCE, 2015, : 197 - 209
  • [4] Portable transparent checkpointing for distributed shared memory
    Silva, LM
    Silva, JG
    Chapple, S
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 1996, : 422 - 431
  • [5] A checkpointing algorithm for an SCI based distributed shared memory system
    Kalaiselvi, S
    Rajaraman, V
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 1999, 22 (09) : 515 - 522
  • [6] A Distributed Shared Memory Middleware for Speculative Parallel Discrete Event Simulation
    Principe, Matteo
    Tocci, Tommaso
    Di Sanzo, Pierangelo
    Quaglia, Francesco
    Pellegrini, Alessandro
    [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2020, 30 (02):
  • [7] Rebound: Scalable Checkpointing for Coherent Shared Memory
    Agarwal, Rishi
    Garg, Pranav
    Torrellas, Josep
    [J]. ISCA 2011: PROCEEDINGS OF THE 38TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2011, : 153 - 164
  • [8] Hardware for speculative run-time parallelization in distributed shared-memory multiprocessors
    Zhang, Y
    Rauchwerger, L
    Torrellas, J
    [J]. 1998 FOURTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1998, : 162 - 173
  • [9] Checkpointing and recovery of shared memory parallel applications in a cluster
    Badrinath, R
    Morin, C
    Vallée, G
    [J]. CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 471 - 478
  • [10] Application-level checkpointing for shared memory programs
    Bronevetsky, G
    Marques, D
    Pingali, K
    Szwed, P
    Schulz, M
    [J]. ACM SIGPLAN NOTICES, 2004, 39 (11) : 235 - 247