Enhancing MPI plus OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support

被引：1

作者：

Ferat, Manuel ^{[1
]}

Pereira, Romain ^{[2
,4
,5
]}

Roussel, Adrien ^{[3
,4
]}

Carribault, Patrick ^{[3
,4
]}

Steffenel, Luiz-Angelo ^{[1
]}

Gautier, Thierry ^{[5
]}

机构：

[1] Univ Reims, LRC DIGIT, LICIIS, F-51097 Reims, France

[2] CEA, DAM, DIF, F-91297 Arpajon, France

[3] CEA, DAM, DIF, LRC DIGIT, F-91297 Arpajon, France

[4] Univ Paris Saclay, CEA, Lab Informat Haute Performance Calcul & Simulat, F-91680 Bruyeres Le Chatel, France

[5] ENS Lyon, LIP, Project Team AVALON INRIA, Lyon, France

来源：

OPENMP IN A MODERN WORLD: FROM MULTI-DEVICE SUPPORT TO META PROGRAMMING | 2022年 / 13527卷

关键词：

OpenMP; GPU Computing; Distributed Application; Task programming;

D O I：

10.1007/978-3-031-15922-0_1

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Heterogeneous supercomputers are widespread over HPC systems and programming efficient applications on these architectures is a challenge. Task-based programming models are a promising way to tackle this challenge. Since OpenMP 4.0 and 4.5, the target directives enable to offload pieces of code to GPUs and to express it as tasks with dependencies. Therefore, heterogeneous machines can be programmed using MPI+OpenMP(task+target) to exhibit a very high level of concurrent asynchronous operations for which data transfers, kernel executions, communications and CPU computations can be overlapped. Hence, it is possible to suspend tasks performing these asynchronous operations on the CPUs and to overlap their completion with another task execution. Suspended tasks can resume once the associated asynchronous event is completed in an opportunistic way at every scheduling point. We have integrated this feature into the MPC framework and validated it on a AXPY microbenchmark and evaluated on a MPI+OpenMP(tasks) implementation of the LULESH proxy applications. The results show that we are able to improve asynchronism and the overall HPC performance, allowing applications to benefit from asynchronous execution on heterogeneous machines.

引用

页码：3 / 16

页数：14

共 50 条

[1] Automatic Partitioning of MPI Operations in MPI plus OpenMP Applications
Jammer, Tim
Bischof, Christian
HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 191 - 198
[2] Investigating Dependency Graph Discovery Impact on Task-based MPI plus OpenMP Applications Performances
Pereira, Romain
Roussel, Adrien
Carribault, Patrick
Gautier, Thierry
PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 163 - 172
[3] Communication-Aware Task Scheduling Strategy in Hybrid MPI plus OpenMP Applications
Pereira, Romain
Roussel, Adrien
Carribault, Patrick
Gautier, Thierry
OPENMP: ENABLING MASSIVE NODE-LEVEL PARALLELISM, IWOMP 2021, 2021, 12870 : 197 - 210
[4] MPI Thread-Level Checking for MPI plus OpenMP Applications
Saillard, Emmanuelle
Carribault, Patrick
Barthou, Denis
EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 31 - 42
[5] Enhancing Heterogeneous Computing Through OpenMP and GPU Graph
Yu, Chenle
Royuela, Sara
Quinones, Eduardo
53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 534 - 543
[6] Toward Heterogeneous MPI plus MPI Programming: Comparison of OpenMP and MPI Shared Memory Models
Szustak, Lukasz
Wyrzykowski, Roman
Halbiniak, Kamil
Bratek, Pawel
EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 270 - 281
[7] Dynamic load balancing of MPI plus OpenMP applications
Corbalán, J
Duran, A
Labarta, J
2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2004, : 195 - 202
[8] CHAMELEON: Reactive Load Balancing for Hybrid MPI plus OpenMP Task-Parallel Applications
Klinkenberg, Jannis
Samfass, Philipp
Bader, Michael
Terboven, Christian
Mueller, Matthias S.
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 138 (55-64) : 55 - 64
[9] User Co-scheduling for MPI plus OpenMP Applications Using OpenMP Semantics
Capra, Antoine
Carribault, Patrick
Besnard, Jean-Baptiste
Malony, Allen D.
Perache, Marc
Jaeger, Julien
SCALING OPENMP FOR EXASCALE PERFORMANCE AND PORTABILITY (IWOMP 2017), 2017, 10468 : 203 - 216
[10] Enhancing OCM to support MPI applications
Bubak, M
Funika, W
Gembarowski, R
Hodurek, P
Wismüller, R
HIGH-PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 1999, 1593 : 1274 - 1277

← 1 2 3 4 5 →