Performance-based parallel loop self-scheduling using hybrid OpenMP and MPI programming on multicore SMP clusters

被引:7
|
作者
Yang, Chao-Tung [1 ]
Wu, Chao-Chin [2 ]
Chang, Jen-Hsiang [1 ,3 ]
机构
[1] Tunghai Univ, Dept Comp Sci, Taichung 40704, Taiwan
[2] Natl Changhua Univ Educ, Dept Comp Sci & Informat Engn, Changhua 500, Taiwan
[3] Tungs Taichung MeteroHarbor Hosp, Taichung 435, Taichung County, Taiwan
来源
关键词
parallel loop; self-scheduling; multicore; SMP cluster; hybrid; OpenMP; MPI; SCHEME; INFRASTRUCTURE;
D O I
10.1002/cpe.1627
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Parallel loop self-scheduling on parallel and distributed systems has been a critical problem and it is becoming more difficult to deal with in the emerging heterogeneous cluster computing environments. In the past, some self-scheduling schemes have been proposed as applicable to heterogeneous cluster computing environments. In recent years, multicore computers have been widely included in cluster systems. However, previous researches into parallel loop self-scheduling did not consider certain aspects of multicore computers; for example, it is more appropriate for shared-memory multiprocessors to adopt Open Multi-Processing (OpenMP) for parallel programming. In this paper, we propose a performance-based approach using hybrid OpenMP and MPI parallel programming, which partition loop iterations according to the performance weighting of multicore nodes in a cluster. Because iterations assigned to one MPI process are processed in parallel by OpenMP threads run by the processor cores in the same computational node, the number of loop iterations allocated to one computational node at each scheduling step depends on the number of processor cores in that node. Experimental results show that the proposed approach performs better than previous schemes. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:721 / 744
页数:24
相关论文
共 42 条
  • [1] Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters
    Chao-Chin Wu
    Lien-Fu Lai
    Chao-Tung Yang
    Po-Hsun Chiu
    [J]. The Journal of Supercomputing, 2012, 60 : 31 - 61
  • [2] Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters
    Wu, Chao-Chin
    Lai, Lien-Fu
    Yang, Chao-Tung
    Chiu, Po-Hsun
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 60 (01): : 31 - 61
  • [3] Performance-Based Parallel Loop Self-scheduling on Heterogeneous Multicore PC Clusters
    Yang, Chao-Tung
    Chang, Jen-Hsiang
    Wu, Chao-Chin
    [J]. HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 509 - +
  • [4] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
    Yang, Chao-Tung
    Huang, Chih-Lin
    Lin, Cheng-Fang
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
  • [5] Designing parallel loop self-scheduling schemes using the hybrid MPI and OpenMP programming model for multi-core grid systems
    Chao-Chin Wu
    Chao-Tung Yang
    Kuan-Chou Lai
    Po-Hsun Chiu
    [J]. The Journal of Supercomputing, 2012, 59 : 42 - 60
  • [6] Designing parallel loop self-scheduling schemes using the hybrid MPI and OpenMP programming model for multi-core grid systems
    Wu, Chao-Chin
    Yang, Chao-Tung
    Lai, Kuan-Chou
    Chiu, Po-Hsun
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 59 (01): : 42 - 60
  • [7] Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP
    Zhang, Yuanyuan
    Iwashita, Hidetoshi
    Ishii, Kuninori
    Kaneko, Masanori
    Nakamura, Tomotake
    Hotta, Kohichiro
    [J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 133 - 148
  • [8] Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes
    Rabenseifner, Rolf
    Hager, Georg
    Jost, Gabriele
    [J]. PROCEEDINGS OF THE PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2009, : 427 - +
  • [9] A performance-based parallel loop self-scheduling on grid computing environments
    Shih, WC
    Yang, CT
    Tseng, SS
    [J]. NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 48 - 55
  • [10] A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters
    Takahashi, Daisuke
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 970 - 977