Tracing task-based runtime systems: Feedbacks from the StarPU case

被引:0
|
作者
Denis, Alexandre [1 ,2 ]
Jeannot, Emmanuel [1 ,2 ]
Swartvagher, Philippe [1 ,2 ,3 ]
Thibault, Samuel [1 ,2 ]
机构
[1] Inria Bordeaux Sud Ouest, Talence, France
[2] Univ Bordeaux, LaBRI, Talence, France
[3] Bordeaux INP, Talence, France
来源
关键词
clock synchronization; distributed clocks; high-performance computing; task-based runtime systems; tracing systems; TIME; SYNCHRONIZATION;
D O I
10.1002/cpe.7920
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the complexity of current supercomputers and applications, being able to trace application executions to understand their behavior is not a luxury. As constraints, tracing systems have to be as little intrusive as possible in the application code and performances, and be precise enough in the collected data. In this article, we present how works the tracing system used by the task-based runtime system StarPU. We study the different sources of performance overhead coming from the tracing system and how to reduce these overheads. Then, we evaluate the accuracy of distributed traces with different clock synchronization techniques. Finally, we summarize our experiments and conclusions with the lessons we learned to efficiently trace applications, and the list of characteristics each tracing system should feature to be competitive. The reported experiments and implementation details comprise a feedback of integrating into a task-based runtime system state-of-the-art techniques to efficiently and precisely trace application executions. We highlight the points every application developer or end-user should be aware of to seamlessly integrate a tracing system or just trace application executions.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Mitigating the NUMA effect on task-based runtime systems
    Maronas, Marcos
    Navarro, Antoni
    Ayguade, Eduard
    Beltran, Vicenc
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (13): : 14287 - 14312
  • [2] Mitigating the NUMA effect on task-based runtime systems
    Marcos Maroñas
    Antoni Navarro
    Eduard Ayguadé
    Vicenç Beltran
    [J]. The Journal of Supercomputing, 2023, 79 : 14287 - 14312
  • [3] Fast approximation algorithms for task-based runtime systems
    Beaumont, Olivier
    Eyraud-Dubois, Lionel
    Kumar, Suraj
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (17):
  • [4] Asynchronous Execution of Python']Python Code on Task-Based Runtime Systems
    Tohid, R.
    Wagle, Bibek
    Shirzad, Shahrzad
    Diehl, Patrick
    Serio, Adrian
    Kheirkhahan, Alireza
    Amini, Parsa
    Williams, Katy
    Isaacs, Kate
    Huck, Kevin
    Brandt, Steven
    Kaiser, Hartmut
    [J]. PROCEEDINGS OF 2018 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON EXTREME SCALE PROGRAMMING MODELS AND MIDDLEWARE (ESPM2 2018), 2018, : 37 - 45
  • [5] Assembly Operations for Multicore Architectures Using Task-Based Runtime Systems
    Genet, Damien
    Guermouche, Abdou
    Bosilca, George
    [J]. EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT II, 2014, 8806 : 338 - 350
  • [6] An On-Line Performance Introspection Framework for Task-Based Runtime Systems
    Aguilar, Xavier
    Jordan, Herbert
    Heller, Thomas
    Hirsch, Alexander
    Fahringer, Thomas
    Laure, Erwin
    [J]. COMPUTATIONAL SCIENCE - ICCS 2019, PT I, 2019, 11536 : 238 - 252
  • [7] A Hardware Runtime for Task-Based Programming Models
    Tan, Xubin
    Bosch, Jaume
    Alvarez, Carlos
    Jimenez-Gonzalez, Daniel
    Ayguade, Eduard
    Valero, Mateo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (09) : 1932 - 1946
  • [8] Increasing the degree of parallelism using speculative execution in task-based runtime systems
    Bramas, Berenger
    [J]. PEERJ COMPUTER SCIENCE, 2019, 2019 (03)
  • [9] Implementing the Broadcast Operation in a Distributed Task-based Runtime
    Ceccato, Rodrigo
    Yviquel, Herve
    Pereira, Marcio
    Souza, Alan
    Araujo, Guido
    [J]. 2022 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW 2022), 2022, : 25 - 32
  • [10] Flexible Data Redistribution in a Task-Based Runtime System
    Cao, Qinglei
    Bosilca, George
    Wu, Wei
    Zhong, Dong
    Bouteiller, Aurelien
    Dongarra, Jack
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 221 - 225