Design and implementation of dynamic process management for grid-enabled MPICH

被引:0
|
作者
Kim, S [1 ]
Woo, N
Yeom, HY
Park, T
Park, HW
机构
[1] Seoul Natl Univ, Sch Engn & Comp Sci, Seoul 151742, South Korea
[2] Sejong Univ, Dept Comp Engn, Seoul 143747, South Korea
[3] KISTI, Supercomp Ctr, Taejon, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents the design and impementation of MPI-Rejoin() for MPICH-GF, a grid-enabled fault tolerant MPICH implementation. To provide fault tolerance to the MPI applications, it is mandatory for a failed process to recover and continue execution. However, current MPI implementations do not support dynamic process management and it is not possible to restore the information regarding communication channels. The 'rejoin' operation allows the restored process to rejoin the existing group by updating the corresponding entries of the channel table with the new physical address. We have verified that our implementation can correctly reconstruct the MPI communication. structure by running NPB applications. We also report on the cost of 'rejoin' operation.
引用
收藏
页码:653 / 656
页数:4
相关论文
共 50 条
  • [1] MPICH-G2: A Grid-enabled implementation of the Message Passing Interface
    Karonis, NT
    Toonen, B
    Foster, I
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2003, 63 (05) : 551 - 563
  • [2] The design and implementation of a grid-enabled catalogue service
    Wei, YX
    Di, LP
    Liao, GX
    Chen, AJ
    Bai, YQ
    Liu, Y
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 4224 - 4227
  • [3] Design and implementation of a grid-enabled component container for CORBA Lightweight Components
    Sevilla, D
    García, JM
    Gómez, A
    [J]. GRID COMPUTING, 2004, 2970 : 59 - 66
  • [4] A grid-enabled pipeline for siRNA design
    He, KJ
    Dong, SB
    Du, ZP
    Cao, YC
    Zhang, L
    [J]. PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 874 - 880
  • [5] Dynamic parallelization of grid-enabled web services
    Wurz, M
    Schuldt, H
    [J]. ADVANCES IN GRID COMPUTING - EGC 2005, 2005, 3470 : 173 - 183
  • [6] MPICH-GF: Transparent checkpointing and rollback-recovery for grid-enabled MPI processes
    Woo, N
    Jung, HS
    Yeom, HY
    Park, T
    Park, H
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1820 - 1828
  • [7] Implementation of a Grid-enabled problem solving environment in Matlab
    Eres, H
    Pound, G
    Jiao, ZA
    Wason, J
    Xu, FL
    Keane, A
    Cox, S
    [J]. COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 420 - 429
  • [8] Grid-enabled virtual organization based dynamic firewall
    Green, ML
    Gallo, SM
    Miller, R
    [J]. FIFTH IEEE/ACM INTERNATIONAL WORKSHOP ON GRID COMPUTING, PROCEEDINGS, 2004, : 208 - 216
  • [9] Grid-enabled standards-based data management
    Abadie, Lana
    Badino, Paolo
    Baud, Jean-Philippe
    Casey, James
    Frohner, Akos
    Grosdidier, Gilbert
    Lemaitre, Sophie
    Mccance, Gavin
    Mollon, Remi
    Nienartowicz, Krzysztof
    Srnith, David
    Tedesco, Paolo
    [J]. 24TH IEEE CONFERENCE ON MASS STORAGE SYSTEMS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 60 - +
  • [10] Grid-enabled workflow management system based on BPEL
    Ma, Ru-Yue
    Wu, Yong-Wei
    Meng, Xiang-Xu
    Liu, Shi-Jun
    Pan, Li
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2008, 22 (03): : 238 - 249