Fine-Grained MPI plus OpenMP Plasma Simulations: Communication Overlap with Dependent Tasks

被引:3
|
作者
Richard, Jerome [1 ,2 ]
Latu, Guillaume [1 ]
Bigot, Julien [3 ]
Gautier, Thierry [4 ]
机构
[1] CEA, IRFM, F-13108 St Paul Les Durance, France
[2] Zebrys, Toulouse, France
[3] Univ Paris Saclay, UVSQ, Univ Paris Sud, Maison Simulat,CEA,CNRS, Gif Sur Yvette, France
[4] Univ Lyon, INRIA, CNRS, ENS Lyon,Univ Claude Bernard Lyon 1,LIP, Lyon, France
来源
关键词
Dependent tasks; OpenMP; 4.5; MPI; Many-core;
D O I
10.1007/978-3-030-29400-7_30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper demonstrates how OpenMP 4.5 tasks can be used to efficiently overlap computations and MPI communications based on a case-study conducted on multi-core and many-core architectures. It focuses on task granularity, dependencies and priorities, and also identifies some limitations of OpenMP. Results on 64 Skylake nodes show that while 64% of the wall-clock time is spent in MPI communications, 60% of the cores are busy in computations, which is a good result. Indeed, the chosen dataset is small enough to be a challenging case in terms of overlap and thus useful to assess worst-case scenarios in future simulations. Two key features were identified: by using task priority we improved the performance by 5.7% (mainly due to an improved overlap), and with recursive tasks we shortened the execution time by 9.7%. We also illustrate the need to have access to tools for task tracing and task visualization. These tools allowed a fine understanding and a performance increase for this task-based OpenMP+MPI code.
引用
收藏
页码:419 / 433
页数:15
相关论文
共 50 条
  • [41] Discretization parameters in fine-grained lattice simulations of linear and branched polymers
    Natarajan, S
    Maranas, JK
    JOURNAL OF CHEMICAL PHYSICS, 2003, 118 (19): : 9053 - 9057
  • [42] Spatial Focus Attention for Fine-Grained Skeleton-Based Action Tasks
    Liu, Kaiyuan
    Li, Yunheng
    Xu, Yuanfeng
    Liu, Shuai
    Liu, Shenglan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1883 - 1887
  • [43] Processing of fine-grained aluminum foam by spark plasma sintering
    Wen, CE
    Mabuchi, M
    Yamada, Y
    Shimojima, K
    Chino, Y
    Hosokawa, H
    Asahina, T
    JOURNAL OF MATERIALS SCIENCE LETTERS, 2003, 22 (20) : 1407 - 1409
  • [44] AfterOMPT: An OMPT-Based Tool for Fine-Grained Tracing of Tasks and Loops
    Wodiany, Igor
    Drebes, Andi
    Neill, Richard
    Pop, Antoniu
    OPENMP: PORTABLE MULTI-LEVEL PARALLELISM ON MODERN SYSTEMS, 2020, 12295 : 165 - 180
  • [45] Worksharing Tasks: an Efficient Way to Exploit Irregular and Fine-Grained Loop Parallelism
    Maronas, Marcos
    Sala, Kevin
    Mateo, Sergi
    Ayguade, Eduard
    Beltran, Vicenc
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 383 - 394
  • [46] Mixing coarse-grained and fine-grained water in molecular dynamics simulations of a single system
    Riniker, Sereina
    van Gunsteren, Wilfred F.
    JOURNAL OF CHEMICAL PHYSICS, 2012, 137 (04):
  • [47] Fine-grained Parallel Solution for Solving Sparse Triangular Systems on Multicore Platform using OpenMP Interface
    Marrakchi, Sirine
    Jemni, Mohamed
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 659 - 666
  • [48] A Fine-grained Analysis of Wireless Powered Communication with Poisson Cluster Process
    Zhou, Siyuan
    Zhao, Jinhang
    Tan, Guoping
    Li, Xujie
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [49] TRAM: Optimizing Fine-grained Communication with Topological Routing and Aggregation of Messages
    Wesolowski, Lukasz
    Venkataraman, Ramprasad
    Gupta, Abhishek
    Yeom, Jae-Seung
    Bisset, Keith
    Sun, Yanhua
    Jetley, Pritish
    Quinn, Thomas R.
    Kale, Laxmikant V.
    2014 43RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2014, : 211 - 220
  • [50] Fine-Grained Analysis of Communication Similarity between Real and Proxy Applications
    Aaziz, Omar
    Vaughan, Courtenay
    Cook, Jonathan
    Cook, Jeanine
    Kuehn, Jeffery
    Richards, David
    PROCEEDINGS OF 2019 IEEE/ACM PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS 2019), 2019, : 93 - 102