Fine-Grained MPI plus OpenMP Plasma Simulations: Communication Overlap with Dependent Tasks

被引:3
|
作者
Richard, Jerome [1 ,2 ]
Latu, Guillaume [1 ]
Bigot, Julien [3 ]
Gautier, Thierry [4 ]
机构
[1] CEA, IRFM, F-13108 St Paul Les Durance, France
[2] Zebrys, Toulouse, France
[3] Univ Paris Saclay, UVSQ, Univ Paris Sud, Maison Simulat,CEA,CNRS, Gif Sur Yvette, France
[4] Univ Lyon, INRIA, CNRS, ENS Lyon,Univ Claude Bernard Lyon 1,LIP, Lyon, France
来源
关键词
Dependent tasks; OpenMP; 4.5; MPI; Many-core;
D O I
10.1007/978-3-030-29400-7_30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper demonstrates how OpenMP 4.5 tasks can be used to efficiently overlap computations and MPI communications based on a case-study conducted on multi-core and many-core architectures. It focuses on task granularity, dependencies and priorities, and also identifies some limitations of OpenMP. Results on 64 Skylake nodes show that while 64% of the wall-clock time is spent in MPI communications, 60% of the cores are busy in computations, which is a good result. Indeed, the chosen dataset is small enough to be a challenging case in terms of overlap and thus useful to assess worst-case scenarios in future simulations. Two key features were identified: by using task priority we improved the performance by 5.7% (mainly due to an improved overlap), and with recursive tasks we shortened the execution time by 9.7%. We also illustrate the need to have access to tools for task tracing and task visualization. These tools allowed a fine understanding and a performance increase for this task-based OpenMP+MPI code.
引用
收藏
页码:419 / 433
页数:15
相关论文
共 50 条
  • [21] Profiling techniques for communication in fine-grained parallel languages
    Scheiman, Chris J.
    Haake, Bjoern
    Ibel, Maximilian
    Schauser, Klaus E.
    Software - Practice and Experience, 1999, 29 (06): : 519 - 550
  • [22] Facilitating the Communication of Politeness through Fine-Grained Paraphrasing
    Fu, Liye
    Fussell, Susan R.
    Danescu-Niculescu-Mizil, Cristian
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5127 - 5140
  • [23] A Fine-grained Modulation Technology for ZigBee Backscatter Communication
    Xu, Zhaoyuan
    Ding, Yuan
    Gong, Wei
    PROCEEDINGS OF THE 2022 21ST MEDITERRANEAN MICROWAVE SYMPOSIUM (MMS 2022), 2022, : 85 - 88
  • [24] Profiling techniques for communication in fine-grained parallel languages
    Scheiman, CJ
    Haake, B
    Ibel, M
    Schauser, KE
    SOFTWARE-PRACTICE & EXPERIENCE, 1999, 29 (06): : 519 - 550
  • [25] Enabling Fine-Grained OpenMP Tasking on Tightly-Coupled Shared Memory Clusters
    Burgio, Paolo
    Tagliavini, Giuseppe
    Marongiu, Andrea
    Benini, Luca
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 1504 - 1509
  • [26] Communication-Aware Task Scheduling Strategy in Hybrid MPI plus OpenMP Applications
    Pereira, Romain
    Roussel, Adrien
    Carribault, Patrick
    Gautier, Thierry
    OPENMP: ENABLING MASSIVE NODE-LEVEL PARALLELISM, IWOMP 2021, 2021, 12870 : 197 - 210
  • [27] Fine-Grained Distribution-Dependent Learning Curves
    Bousquet, Olivier
    Hanneke, Steve
    Moran, Shay
    Shafer, Jonathan
    Tolstikhin, Ilya
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [28] A Framework for Fine-Grained Synchronization of Dependent GPU Kernels
    Jangda, Abhinav
    Maleki, Saeed
    Dehnavi, Maryam Mehri
    Musuvathi, Madan
    Saarikivi, Olli
    2024 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO, 2024, : 93 - 105
  • [29] Fault-tolerant scheduling of fine-grained tasks in grid environments
    Wrzesinska, G
    van Nieuwpoort, RV
    Maassen, J
    Kielmann, T
    Bal, HE
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2006, 20 (01): : 103 - 114
  • [30] Performance Evaluation of Priority Queues for Fine-Grained Parallel Tasks on GPUs
    Baudis, Nikolai
    Jacob, Florian
    Andelfinger, Philipp
    2017 IEEE 25TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2017, : 1 - 11