Performance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters

被引:9
|
作者
Wu, Xingfu [1 ]
Taylor, Valerie [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
来源
COMPUTER JOURNAL | 2012年 / 55卷 / 02期
基金
美国国家科学基金会;
关键词
performance characteristics; hybrid MPI; OpenMP; NAS Parallel Benchmarks; multicore clusters;
D O I
10.1093/comjnl/bxr063
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The NAS Parallel Benchmarks (NPB) are well-known applications with fixed algorithms for evaluating parallel systems and tools. Multicore clusters provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node, and MPI can be used with the communication between nodes. In this paper, we use Scalar Pentadiagonal (SP) and Block Tridiagonal (BT) benchmarks of MPI NPB 3.3 as a basis for a comparative approach to implement hybrid MPI/OpenMP versions of SP and BT. In particular, we can compare the performance of the hybrid SP and BT with the MPI counterparts on large-scale multicore clusters, Intrepid (BlueGene/P) at Argonne National Laboratory and Jaguar (Cray XT4/5) at Oak Ridge National Laboratory. Our performance results indicate that the hybrid SP outperforms the MPI SP by up to 20.76%, and the hybrid BT outperforms the MPI BT by up to 8.58% on up to 10 000 cores on Intrepid and Jaguar. We also use performance tools and MPI trace libraries available on these clusters to further investigate the performance characteristics of the hybrid SP and BT.
引用
收藏
页码:154 / 167
页数:14
相关论文
共 29 条
  • [1] Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
    Wu, Xingfu
    Taylor, Valerie
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (08) : 1256 - 1268
  • [2] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
    Yang, Chao-Tung
    Huang, Chih-Lin
    Lin, Cheng-Fang
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
  • [3] Performance analysis of large-scale OpenMP and hybrid MPI/OpenMP applications with Vampir NG
    Brunst, Holger
    Mohr, Bernd
    [J]. OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2008, 4315 : 5 - +
  • [4] Practical simulation of large-scale parallel programs and its performance analysis of the NAS parallel benchmarks
    Kubota, K
    Itakura, K
    Sato, M
    Boku, T
    [J]. EURO-PAR '98 PARALLEL PROCESSING, 1998, 1470 : 244 - 254
  • [5] Performance Characteristics of Hybrid MPI/OpenMP Scientific Applications on a Large-scale Multithreaded BlueGene/Q Supercomputer
    Wu, Xingfu
    Taylor, Valerie
    [J]. 2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 303 - 309
  • [6] Performance characteristics of hybrid MPI/OpenMP scientific applications on a large-scale multithreaded BlueGene/Q supercomputer
    Wu X.
    Taylor V.
    [J]. International Journal of Networked and Distributed Computing, 2013, 1 (4) : 213 - 225
  • [7] Performance Characteristics of Hybrid MPI/OpenMP Scientific Applications on a Large-scale Multithreaded BlueGene/Q Supercomputer
    Wu, Xingfu
    Taylor, Valerie
    [J]. INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2013, 1 (04) : 213 - 225
  • [8] High-performance parallel implementations of the NAS kernel benchmarks on the IBM SP2
    Thomas J. Watson Research Cent, Yorktown Heights, United States
    [J]. IBM Syst J, 2 (263-272):
  • [9] HIGH-PERFORMANCE PARALLEL IMPLEMENTATIONS OF THE NAS KERNEL BENCHMARKS ON THE IBM SP2
    AGARWAL, RC
    ALPERN, B
    CARTER, L
    GUSTAVSON, FG
    KLEPACKI, DJ
    LAWRENCE, R
    ZUBAIR, M
    [J]. IBM SYSTEMS JOURNAL, 1995, 34 (02) : 263 - 272
  • [10] Performance-based parallel loop self-scheduling using hybrid OpenMP and MPI programming on multicore SMP clusters
    Yang, Chao-Tung
    Wu, Chao-Chin
    Chang, Jen-Hsiang
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (08): : 721 - 744