Flexible Data Redistribution in a Task-Based Runtime System

被引:3
|
作者
Cao, Qinglei [1 ]
Bosilca, George [1 ]
Wu, Wei [2 ]
Zhong, Dong [1 ]
Bouteiller, Aurelien [1 ]
Dongarra, Jack [1 ,3 ]
机构
[1] Univ Tennessee, Knoxville, TN 37996 USA
[2] Los Alamos Natl Lab, Washington, DC USA
[3] Univ Manchester, Manchester, Lancs, England
关键词
Data redistribution; Data size; Task-based runtime system; High-performance computing; EFFICIENT DATA REDISTRIBUTION; ALGORITHMS;
D O I
10.1109/CLUSTER49012.2020.00032
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data redistribution aims to reshuffle data to optimize some objective for an algorithm. The objective can be multi-dimensional, such as improving computational load balance or decreasing communication volume or cost, with the ultimate goal to increase the efficiency and therefore decrease the time-to-solution for the algorithm. The classical redistribution problem focuses on optimally scheduling communications when reshuffling data between two regular, usually block-cyclic, data distributions. Recently, task-based runtime systems have gained popularity as a potential candidate to address the programming complexity on the way to exascale. In addition to an increase in portability against complex hardware and software systems, task-based runtime systems have the potential to be able to more easily cope with less-regular data distribution, providing a more balanced computational load during the lifetime of the execution. In this scenario, it becomes paramount to develop a general redistribution algorithm for task-based runtime systems, which could support all types of regular and irregular data distributions. In this paper, we detail a flexible redistribution algorithm, capable of dealing with redistribution problems without constraints of data distribution and data size and implement it in a task-based runtime system, PaRSEC. Performance results show great capability compared to ScaLAPACK, and applications highlight an increased efficiency with little overhead in terms of data distribution and data size.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 50 条
  • [1] Deploying a Task-based Runtime System on Raspberry Pi Clusters
    Gupta, Nikunj
    Brandt, Steve R.
    Wagle, Bibek
    Wu, Nanmiao
    Kheirkhahan, Alireza
    Diehl, Patrick
    Baumann, Felix W.
    Kaiser, Hartmut
    [J]. PROCEEDINGS OF 2020 IEEE/ACM FIFTH INTERNATIONAL WORKSHOP ON EXTREME SCALE PROGRAMMING MODELS AND MIDDLEWARE (ESPM2 2020), 2020, : 11 - 20
  • [2] Evaluating Dynamic Task Scheduling in a Task-Based Runtime System for Heterogeneous Architectures
    Becker, Thomas
    Karl, Wolfgang
    Schuele, Tobias
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2019, 2019, 11479 : 142 - 155
  • [3] Automatic Code Generation and Data Management for an Asynchronous Task-based Runtime
    Baskaran, Muthu
    Pradelle, Benoit
    Meister, Benoit
    Konstantinidis, Athanasios
    Lethin, Richard
    [J]. PROCEEDINGS OF ESPT 2016: 5TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS, 2016, : 34 - 41
  • [4] IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming
    Miniskar, Narasinga Rao
    Lee, Seyong
    Beau, Johnston
    Young, Aaron
    Monil, Mohammad Alaul Haque
    Valero-Lara, Pedro
    Vetter, Jeffrey S.
    [J]. ASYNCHRONOUS MANY-TASK SYSTEMS AND APPLICATIONS, WAMTA 2024, 2024, 14626 : 46 - 58
  • [5] A Hardware Runtime for Task-Based Programming Models
    Tan, Xubin
    Bosch, Jaume
    Alvarez, Carlos
    Jimenez-Gonzalez, Daniel
    Ayguade, Eduard
    Valero, Mateo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (09) : 1932 - 1946
  • [6] Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System
    Sergent, Marc
    Goudin, David
    Thibault, Samuel
    Aumage, Olivier
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 318 - 327
  • [7] Adaptive scheduling of collocated applications using a task-based runtime system
    Dokulil, Jiri
    Benkner, Siegfried
    [J]. 2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 41 - 48
  • [8] TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C plus
    Cambier, Leopold
    Qian, Yizhou
    Darve, Eric
    [J]. PROCEEDINGS OF 2020 IEEE/ACM 3RD ANNUAL PARALLEL APPLICATIONS WORKSHOP, ALTERNATIVES TO MPI+X (PAW-ATM 2020), 2020, : 16 - 26
  • [9] An Optimized Task-Based Runtime System for Resource-Constrained Parallel Accelerators
    Cesarini, Daniele
    Marongiu, Andrea
    Benini, Luca
    [J]. PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1261 - 1266
  • [10] Mitigating the NUMA effect on task-based runtime systems
    Maronas, Marcos
    Navarro, Antoni
    Ayguade, Eduard
    Beltran, Vicenc
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (13): : 14287 - 14312