Consistent checkpointing for high performance clusters

被引:0
|
作者
Nishioka, T [1 ]
Hori, A [1 ]
Ishikawa, Y [1 ]
机构
[1] MRI Syst Inc, Chuo Ku, Tokyo 1040053, Japan
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a consistent checkpointing (CCP) technique for high performance clusters. The proposed CCP method uses local disks of cluster nodes to obtain high scalability. It is evaluated using NAS parallel benchmark programs. The evaluation results show that the scalability is slightly degraded because the I/O performance of the local disks varies widely, but it is far better than Me case using a centralized NFS server as a stable storage.
引用
收藏
页码:367 / 368
页数:2
相关论文
共 50 条
  • [11] Transparent parallel checkpointing and migration in clusters and ClusterGrids
    Kovacs, Jozsef
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2009, 4 (03) : 171 - 181
  • [12] A New High Performance Checkpointing Approach for Mobile Computing Systems
    Gupta, Bidyut
    Rahimi, Shahram
    Liu, Ziping
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (5B): : 95 - 104
  • [13] CHPOX: Transparent checkpointing system for Linux clusters
    Sudakov, Oleksandr O.
    Meshcheriakov, Ievgenii S.
    Boyko, Yuriy V.
    IDAACS 2007: PROCEEDINGS OF THE 4TH IEEE WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2007, : 159 - +
  • [14] Checkpointing alternatives for high performance, power-aware processors
    Moshovos, A
    ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2003, : 318 - 321
  • [15] SFT: A Consistent Checkpointing Algorithm with Short Freezing Time
    魏晓辉
    鞠九滨
    Journal of Computer Science & Technology, 2000, (02) : 169 - 175
  • [16] Towards Fast Crash-Consistent Cluster Checkpointing
    Wood, Andrew
    Hershcovitch, Moshik
    Ennmouri, Ilias
    Zong, Weiyu
    Chennuri, Saurav
    Cohen, Sarel
    Sundararaman, Swaminathan
    Waddington, Daniel
    Chin, Peter
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [17] Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms
    Herault, Thomas
    Robert, Yves
    Bouteiller, Aurelien
    Arnold, Dorian
    Ferreira, Kurt B.
    Bosilca, George
    Dongarra, Jack
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 803 - 812
  • [18] High performance linpack benchmark: A fault tolerant implementation without checkpointing
    Colorado School of Mines, Golden, CO, United States
    Proc Int Conf Supercomputing, (162-171):
  • [19] The performance of coordinated and independent checkpointing
    Silva, LM
    Silva, JG
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 280 - 284
  • [20] PERFORMANCE ANALYSIS OF CHECKPOINTING STRATEGIES
    TANTAWI, AN
    RUSCHITZKA, M
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1984, 2 (02): : 123 - 144