Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

被引:0
|
作者
Rabenseifner, Rolf [1 ]
Hager, Georg [2 ]
Jost, Gabriele [3 ]
机构
[1] High Performance Comp Ctr Stuttgart HLRS, Stuttgart, Germany
[2] Erlangen Reg Comp Ctr RRZE, Erlangen, Germany
[3] Texas Adv Comp Ctr, Austin, TX 78758 USA
关键词
D O I
10.1109/.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming models on hierarchically structured hardware: Pure MPI (Message Passing Interface), pure OpenMP (with distributed shared memory extensions) and hybrid MPI+OpenMP in several flavors. We pinpoint cases where a hybrid programming model can indeed be the superior solution because of reduced communication needs and memory consumption, or improved load balance. Furthermore we show that machine topology has a significant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future. Finally we give an outlook on possible standardization goals and extensions that could make hybrid programming easier to do with performance in mind.
引用
下载
收藏
页码:427 / +
页数:2
相关论文
共 50 条
  • [31] Parallel Programming Tools for Multi-core Architectures
    Mohr, Bernd
    Krammer, Bettina
    Mix, Hartmut
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 643 - 652
  • [32] Parallel Dijkstra's Algorithm Based on Multi-core and MPI
    Han, Xiaogang
    Sun, Qinlei
    Fan, Jiangwei
    MACHINERY ELECTRONICS AND CONTROL ENGINEERING III, 2014, 441 : 750 - 753
  • [33] Automatic Tuning of Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Parallel Programming Models
    Nakajima, Kengo
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2012, 2013, 7851 : 435 - 450
  • [34] Hybrid swarm intelligent parallel algorithm research based on multi-core clusters
    Li, Wenjing
    Bi, Yingzhou
    Zhu, Xiaofeng
    Yuan, Chang-an
    Zhang, Xiang-bo
    MICROPROCESSORS AND MICROSYSTEMS, 2016, 47 : 151 - 160
  • [35] Parallel Performance of Numerical Algorithms on Multi-core System Using OpenMP
    Sharma, Sanjay Kumar
    Gupta, Kusum
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 279 - 288
  • [36] Improved Parallel Lexical Analysis Using OpenMP on Multi-Core Machines
    Barve, Amit
    Joshi, Brijendra Kumar
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL(ICAC3'15), 2015, 49 : 211 - 219
  • [37] Hybrid MPI: Efficient Message Passing for Multi-core Systems
    Friedley, Andrew
    Bronevetsky, Greg
    Hoefler, Torsten
    Lumsdaine, Andrew
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [38] Gyrokinetic semi-lagrangian parallel simulation using a hybrid OpenMP/MPI programming
    Latu, G.
    Crouseilles, N.
    Grandgirard, V.
    Sonnendrucker, E.
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2007, 4757 : 356 - +
  • [39] An Efficient Programming Skeleton for Clusters of Multi-Core Processors
    Rad, Mina Hosseini
    Patooghy, Ahmad
    Fazeli, Mahdi
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (06) : 1094 - 1109
  • [40] An Efficient Programming Skeleton for Clusters of Multi-Core Processors
    Mina Hosseini Rad
    Ahmad Patooghy
    Mahdi Fazeli
    International Journal of Parallel Programming, 2018, 46 : 1094 - 1109