Enhancing MPI plus OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

被引:1
|
作者
Ferat, Manuel [1 ]
Pereira, Romain [2 ,4 ,5 ]
Roussel, Adrien [3 ,4 ]
Carribault, Patrick [3 ,4 ]
Steffenel, Luiz-Angelo [1 ]
Gautier, Thierry [5 ]
机构
[1] Univ Reims, LRC DIGIT, LICIIS, F-51097 Reims, France
[2] CEA, DAM, DIF, F-91297 Arpajon, France
[3] CEA, DAM, DIF, LRC DIGIT, F-91297 Arpajon, France
[4] Univ Paris Saclay, CEA, Lab Informat Haute Performance Calcul & Simulat, F-91680 Bruyeres Le Chatel, France
[5] ENS Lyon, LIP, Project Team AVALON INRIA, Lyon, France
关键词
OpenMP; GPU Computing; Distributed Application; Task programming;
D O I
10.1007/978-3-031-15922-0_1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous supercomputers are widespread over HPC systems and programming efficient applications on these architectures is a challenge. Task-based programming models are a promising way to tackle this challenge. Since OpenMP 4.0 and 4.5, the target directives enable to offload pieces of code to GPUs and to express it as tasks with dependencies. Therefore, heterogeneous machines can be programmed using MPI+OpenMP(task+target) to exhibit a very high level of concurrent asynchronous operations for which data transfers, kernel executions, communications and CPU computations can be overlapped. Hence, it is possible to suspend tasks performing these asynchronous operations on the CPUs and to overlap their completion with another task execution. Suspended tasks can resume once the associated asynchronous event is completed in an opportunistic way at every scheduling point. We have integrated this feature into the MPC framework and validated it on a AXPY microbenchmark and evaluated on a MPI+OpenMP(tasks) implementation of the LULESH proxy applications. The results show that we are able to improve asynchronism and the overall HPC performance, allowing applications to benefit from asynchronous execution on heterogeneous machines.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [1] Automatic Partitioning of MPI Operations in MPI plus OpenMP Applications
    Jammer, Tim
    Bischof, Christian
    HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 191 - 198
  • [2] Investigating Dependency Graph Discovery Impact on Task-based MPI plus OpenMP Applications Performances
    Pereira, Romain
    Roussel, Adrien
    Carribault, Patrick
    Gautier, Thierry
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 163 - 172
  • [3] Communication-Aware Task Scheduling Strategy in Hybrid MPI plus OpenMP Applications
    Pereira, Romain
    Roussel, Adrien
    Carribault, Patrick
    Gautier, Thierry
    OPENMP: ENABLING MASSIVE NODE-LEVEL PARALLELISM, IWOMP 2021, 2021, 12870 : 197 - 210
  • [4] MPI Thread-Level Checking for MPI plus OpenMP Applications
    Saillard, Emmanuelle
    Carribault, Patrick
    Barthou, Denis
    EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 31 - 42
  • [5] Enhancing Heterogeneous Computing Through OpenMP and GPU Graph
    Yu, Chenle
    Royuela, Sara
    Quinones, Eduardo
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 534 - 543
  • [6] Toward Heterogeneous MPI plus MPI Programming: Comparison of OpenMP and MPI Shared Memory Models
    Szustak, Lukasz
    Wyrzykowski, Roman
    Halbiniak, Kamil
    Bratek, Pawel
    EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 270 - 281
  • [7] Dynamic load balancing of MPI plus OpenMP applications
    Corbalán, J
    Duran, A
    Labarta, J
    2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2004, : 195 - 202
  • [8] CHAMELEON: Reactive Load Balancing for Hybrid MPI plus OpenMP Task-Parallel Applications
    Klinkenberg, Jannis
    Samfass, Philipp
    Bader, Michael
    Terboven, Christian
    Mueller, Matthias S.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 138 (55-64) : 55 - 64
  • [9] User Co-scheduling for MPI plus OpenMP Applications Using OpenMP Semantics
    Capra, Antoine
    Carribault, Patrick
    Besnard, Jean-Baptiste
    Malony, Allen D.
    Perache, Marc
    Jaeger, Julien
    SCALING OPENMP FOR EXASCALE PERFORMANCE AND PORTABILITY (IWOMP 2017), 2017, 10468 : 203 - 216
  • [10] Enhancing OCM to support MPI applications
    Bubak, M
    Funika, W
    Gembarowski, R
    Hodurek, P
    Wismüller, R
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 1999, 1593 : 1274 - 1277