Consistent checkpointing for high performance clusters

被引:0
|
作者
Nishioka, T [1 ]
Hori, A [1 ]
Ishikawa, Y [1 ]
机构
[1] MRI Syst Inc, Chuo Ku, Tokyo 1040053, Japan
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a consistent checkpointing (CCP) technique for high performance clusters. The proposed CCP method uses local disks of cluster nodes to obtain high scalability. It is evaluated using NAS parallel benchmark programs. The evaluation results show that the scalability is slightly degraded because the I/O performance of the local disks varies widely, but it is far better than Me case using a centralized NFS server as a stable storage.
引用
收藏
页码:367 / 368
页数:2
相关论文
共 50 条
  • [1] Staggered consistent checkpointing
    Vaidya, NH
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (07) : 694 - 702
  • [2] Staggered consistent checkpointing
    IEEE
    不详
    IEEE Trans Parallel Distrib Syst, 7 (694-702):
  • [3] Combination of consistent checkpointing and message logging - a novel CRR scheme for clusters of workstations
    Wang, Dongsheng
    Zheng, Weimin
    Shen, Meiming
    Wang, Dingxing
    Chinese Journal of Electronics, 1997, 6 (03): : 32 - 35
  • [4] Consistent checkpointing for transaction systems
    Baldoni, R
    Quaglia, F
    Raynal, M
    COMPUTER JOURNAL, 2001, 44 (02): : 92 - 100
  • [5] DISTRIBUTED CHECKPOINTING FOR GLOBALLY CONSISTENT STATES OF DATABASES
    SON, SH
    AGRAWALA, AK
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1989, 15 (10) : 1157 - 1167
  • [6] Guaranteed mutually consistent checkpointing in distributed computations
    Yang, ZH
    Sun, CZ
    Sattar, A
    Yang, YY
    ADVANCES IN COMPUTING SCIENCE-ASIAN' 98, 1998, 1538 : 157 - 168
  • [7] High Performance Computing Systems with Various Checkpointing Schemes
    Naksinehaboon, N.
    Paun, M.
    Nassar, R.
    Leangsuksun, B.
    Scott, S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2009, 4 (04) : 386 - 400
  • [8] SFT: A consistent checkpointing algorithm with short freezing time
    Xiaohui Wei
    Jiubin Ju
    Journal of Computer Science and Technology, 2000, 15 : 169 - 175
  • [9] SFT: A consistent checkpointing algorithm with short freezing time
    Wei, XH
    Ju, JB
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (02) : 169 - 175
  • [10] Distributed checkpointing on clusters with dynamic striping and staggering
    Hai, J
    Kai, H
    ADVANCES IN COMPUTING SCIENCE-ASIAN 2002: INTERNET-COMPUTING AND MODELING, GRID COMPUTING, PEER-TO PEER COMPUTING, AND CLUSTER COMPUTING, 2002, 2550 : 19 - 33