Using Dynamic Broadcasts to Improve Task-Based Runtime Performances

被引:6
|
作者
Denis, Alexandre [1 ,2 ]
Jeannot, Emmanuel [1 ,2 ]
Swartvagher, Philippe [1 ,2 ]
Thibault, Samuel [1 ,2 ]
机构
[1] Inria Bordeaux Sud Ouest, F-33405 Talence, France
[2] Univ Bordeaux, LaBRI, F-33405 Talence, France
来源
EURO-PAR 2020: PARALLEL PROCESSING | 2020年 / 12247卷
关键词
Task-based runtime systems; Communications; Collective; Broadcast;
D O I
10.1007/978-3-030-57675-2_28
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Task-based runtimes have emerged in the HPC world to take benefit from the computation power of heterogeneous supercomputers and to achieve scalability. One of the main bottlenecks for scalability is the communication layer. Some task-based algorithms need to send the same data to multiple nodes. To optimize this communication pattern, libraries propose dedicated routines, such as MPI_Bcast. However, MPI_Bcast requirements do not fit well with the constraints of task-based runtime systems: it must be performed simultaneously by all involved nodes, and these must know each other, which is not possible when each node runs a task scheduler not synchronized with others. In this paper, we propose a new approach, called dynamic broadcasts to overcome these constraints. The broadcast communication pattern required by the task-based algorithm is detected automatically, then the broadcasting algorithm relies on active messages and source routing, so that participating nodes do not need to know each other and do not need to synchronize. Receiver receives data the same way as it receives point-to-point communication, without having to know it arrives through a broadcast. We have implemented the algorithm in the STARPU runtime system using the NEWMADELEINE communication library. We performed benchmarks using the CHOLESKY factorization that is known to use broadcasts and observed up to 30% improvement of its total execution time.
引用
收藏
页码:443 / 457
页数:15
相关论文
共 50 条
  • [21] Asynchronous Execution of Python']Python Code on Task-Based Runtime Systems
    Tohid, R.
    Wagle, Bibek
    Shirzad, Shahrzad
    Diehl, Patrick
    Serio, Adrian
    Kheirkhahan, Alireza
    Amini, Parsa
    Williams, Katy
    Isaacs, Kate
    Huck, Kevin
    Brandt, Steven
    Kaiser, Hartmut
    PROCEEDINGS OF 2018 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON EXTREME SCALE PROGRAMMING MODELS AND MIDDLEWARE (ESPM2 2018), 2018, : 37 - 45
  • [22] Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
    Fonseca, Alcides
    Cabral, Bruno
    Rafael, Joao
    Correia, Ivo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (06) : 1337 - 1358
  • [23] An On-Line Performance Introspection Framework for Task-Based Runtime Systems
    Aguilar, Xavier
    Jordan, Herbert
    Heller, Thomas
    Hirsch, Alexander
    Fahringer, Thomas
    Laure, Erwin
    COMPUTATIONAL SCIENCE - ICCS 2019, PT I, 2019, 11536 : 238 - 252
  • [24] Automatic Code Generation and Data Management for an Asynchronous Task-based Runtime
    Baskaran, Muthu
    Pradelle, Benoit
    Meister, Benoit
    Konstantinidis, Athanasios
    Lethin, Richard
    PROCEEDINGS OF ESPT 2016: 5TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS, 2016, : 34 - 41
  • [25] Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System
    Sergent, Marc
    Goudin, David
    Thibault, Samuel
    Aumage, Olivier
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 318 - 327
  • [26] Task-Based Design and Runtime Support for Multimodal User Interface Distribution
    Clerckx, Tim
    Vandervelpen, Chris
    Coninx, Karin
    ENGINEERING INTERACTIVE SYSTEMS, 2008, 4940 : 89 - +
  • [27] TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C plus
    Cambier, Leopold
    Qian, Yizhou
    Darve, Eric
    PROCEEDINGS OF 2020 IEEE/ACM 3RD ANNUAL PARALLEL APPLICATIONS WORKSHOP, ALTERNATIVES TO MPI+X (PAW-ATM 2020), 2020, : 16 - 26
  • [28] Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes
    Lee, Wonchan
    Slaughter, Elliott
    Bauer, Michael
    Treichler, Sean
    Warszawski, Todd
    Garland, Michael
    Aiken, Alex
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE, AND ANALYSIS (SC'18), 2018,
  • [29] Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures
    Virouleau, Philippe
    Broquedis, Francois
    Gautier, Thierry
    Rastello, Fabrice
    EURO-PAR 2016: PARALLEL PROCESSING, 2016, 9833 : 531 - 544
  • [30] Dynamic Guidance for Task-Based Exploratory Learning
    Thomas, James M.
    Young, R. Michael
    ARTIFICIAL INTELLIGENCE IN EDUCATION, 2011, 6738 : 369 - 376