Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

被引:0
|
作者
Rabenseifner, Rolf [1 ]
Hager, Georg [2 ]
Jost, Gabriele [3 ]
机构
[1] High Performance Comp Ctr Stuttgart HLRS, Stuttgart, Germany
[2] Erlangen Reg Comp Ctr RRZE, Erlangen, Germany
[3] Texas Adv Comp Ctr, Austin, TX 78758 USA
关键词
D O I
10.1109/.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming models on hierarchically structured hardware: Pure MPI (Message Passing Interface), pure OpenMP (with distributed shared memory extensions) and hybrid MPI+OpenMP in several flavors. We pinpoint cases where a hybrid programming model can indeed be the superior solution because of reduced communication needs and memory consumption, or improved load balance. Furthermore we show that machine topology has a significant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future. Finally we give an outlook on possible standardization goals and extensions that could make hybrid programming easier to do with performance in mind.
引用
收藏
页码:427 / +
页数:2
相关论文
共 50 条
  • [1] Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters
    Nakajima, Kengo
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010, 2011, 6449 : 185 - 199
  • [2] Performance analysis of a hybrid MPI/OpenMP application on multi-core clusters
    Chorley, Martin J.
    Walker, David W.
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2010, 1 (03) : 168 - 174
  • [3] Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP
    Zhang, Yuanyuan
    Iwashita, Hidetoshi
    Ishii, Kuninori
    Kaneko, Masanori
    Nakamura, Tomotake
    Hotta, Kohichiro
    [J]. BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 133 - 148
  • [4] On the Performance of MPI-OpenMP on a 12 Nodes Multi-core Cluster
    Abdelgadir, Abdelgadir Tageldin
    Pathan, Al-Sakib Khan
    Ahmed, Mohiuddin
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT II, 2011, 7017 : 225 - +
  • [5] A parallel iterative solver for positive-definite systems with hybrid MPI-OpenMP parallelization for multi-core clusters
    Mitin, Igor
    Kalinkin, Alexander
    Laevsky, Yuri
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2012, 3 (06) : 463 - 468
  • [6] MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI plus MPI Parallel Codes
    Zhou, Huan
    Gracia, Jose
    Schneider, Ralf
    [J]. PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPP 2019), 2019,
  • [7] A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters
    Takahashi, Daisuke
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 970 - 977
  • [8] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
    Yang, Chao-Tung
    Huang, Chih-Lin
    Lin, Cheng-Fang
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
  • [9] OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters
    Hayashi, Masae
    Nakajima, Kengo
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2012, 2013, 7851 : 278 - 291
  • [10] Hybrid MPI/OpenMP Optimization in Linpack Benchmark on Multi-core Platforms
    Wang, Feng
    Yang, Canqun
    Bai, Juncheng
    [J]. PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 917 - 920