Performance-based parallel loop self-scheduling using hybrid OpenMP and MPI programming on multicore SMP clusters

被引：7

作者：

Yang, Chao-Tung ^{[1
]}

Wu, Chao-Chin ^{[2
]}

Chang, Jen-Hsiang ^{[1
,3
]}

机构：

[1] Tunghai Univ, Dept Comp Sci, Taichung 40704, Taiwan

[2] Natl Changhua Univ Educ, Dept Comp Sci & Informat Engn, Changhua 500, Taiwan

[3] Tungs Taichung MeteroHarbor Hosp, Taichung 435, Taichung County, Taiwan

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2011年 / 23卷 / 08期

关键词：

parallel loop; self-scheduling; multicore; SMP cluster; hybrid; OpenMP; MPI; SCHEME; INFRASTRUCTURE;

D O I：

10.1002/cpe.1627

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Parallel loop self-scheduling on parallel and distributed systems has been a critical problem and it is becoming more difficult to deal with in the emerging heterogeneous cluster computing environments. In the past, some self-scheduling schemes have been proposed as applicable to heterogeneous cluster computing environments. In recent years, multicore computers have been widely included in cluster systems. However, previous researches into parallel loop self-scheduling did not consider certain aspects of multicore computers; for example, it is more appropriate for shared-memory multiprocessors to adopt Open Multi-Processing (OpenMP) for parallel programming. In this paper, we propose a performance-based approach using hybrid OpenMP and MPI parallel programming, which partition loop iterations according to the performance weighting of multicore nodes in a cluster. Because iterations assigned to one MPI process are processed in parallel by OpenMP threads run by the processor cores in the same computational node, the number of loop iterations allocated to one computational node at each scheduling step depends on the number of processor cores in that node. Experimental results show that the proposed approach performs better than previous schemes. Copyright (C) 2010 John Wiley & Sons, Ltd.

引用

页码：721 / 744

页数：24

共 42 条

[1] Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters
Chao-Chin Wu
Lien-Fu Lai
Chao-Tung Yang
Po-Hsun Chiu
[J]. The Journal of Supercomputing, 2012, 60 : 31 - 61
[2] Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters
Wu, Chao-Chin
Lai, Lien-Fu
Yang, Chao-Tung
Chiu, Po-Hsun
[J]. JOURNAL OF SUPERCOMPUTING, 2012, 60 (01): : 31 - 61
[3] Performance-Based Parallel Loop Self-scheduling on Heterogeneous Multicore PC Clusters
Yang, Chao-Tung
Chang, Jen-Hsiang
Wu, Chao-Chin
[J]. HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 509 - +
[4] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
Yang, Chao-Tung
Huang, Chih-Lin
Lin, Cheng-Fang
[J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
[5] Designing parallel loop self-scheduling schemes using the hybrid MPI and OpenMP programming model for multi-core grid systems
Chao-Chin Wu
Chao-Tung Yang
Kuan-Chou Lai
Po-Hsun Chiu
[J]. The Journal of Supercomputing, 2012, 59 : 42 - 60
[6] Designing parallel loop self-scheduling schemes using the hybrid MPI and OpenMP programming model for multi-core grid systems
Wu, Chao-Chin
Yang, Chao-Tung
Lai, Kuan-Chou
Chiu, Po-Hsun
[J]. JOURNAL OF SUPERCOMPUTING, 2012, 59 (01): : 42 - 60
[7] Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP
Zhang, Yuanyuan
Iwashita, Hidetoshi
Ishii, Kuninori
Kaneko, Masanori
Nakamura, Tomotake
Hotta, Kohichiro
[J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 133 - 148
[8] Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes
Rabenseifner, Rolf
Hager, Georg
Jost, Gabriele
[J]. PROCEEDINGS OF THE PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2009, : 427 - +
[9] A performance-based parallel loop self-scheduling on grid computing environments
Shih, WC
Yang, CT
Tseng, SS
[J]. NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 48 - 55
[10] A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters
Takahashi, Daisuke
[J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 970 - 977

← 1 2 3 4 5 →