Tracing task-based runtime systems: Feedbacks from the StarPU case

被引:0
|
作者
Denis, Alexandre [1 ,2 ]
Jeannot, Emmanuel [1 ,2 ]
Swartvagher, Philippe [1 ,2 ,3 ]
Thibault, Samuel [1 ,2 ]
机构
[1] Inria Bordeaux Sud Ouest, Talence, France
[2] Univ Bordeaux, LaBRI, Talence, France
[3] Bordeaux INP, Talence, France
来源
关键词
clock synchronization; distributed clocks; high-performance computing; task-based runtime systems; tracing systems; TIME; SYNCHRONIZATION;
D O I
10.1002/cpe.7920
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the complexity of current supercomputers and applications, being able to trace application executions to understand their behavior is not a luxury. As constraints, tracing systems have to be as little intrusive as possible in the application code and performances, and be precise enough in the collected data. In this article, we present how works the tracing system used by the task-based runtime system StarPU. We study the different sources of performance overhead coming from the tracing system and how to reduce these overheads. Then, we evaluate the accuracy of distributed traces with different clock synchronization techniques. Finally, we summarize our experiments and conclusions with the lessons we learned to efficiently trace applications, and the list of characteristics each tracing system should feature to be competitive. The reported experiments and implementation details comprise a feedback of integrating into a task-based runtime system state-of-the-art techniques to efficiently and precisely trace application executions. We highlight the points every application developer or end-user should be aware of to seamlessly integrate a tracing system or just trace application executions.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
    Alcides Fonseca
    Bruno Cabral
    João Rafael
    Ivo Correia
    International Journal of Parallel Programming, 2016, 44 : 1337 - 1358
  • [22] IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming
    Miniskar, Narasinga Rao
    Lee, Seyong
    Beau, Johnston
    Young, Aaron
    Monil, Mohammad Alaul Haque
    Valero-Lara, Pedro
    Vetter, Jeffrey S.
    ASYNCHRONOUS MANY-TASK SYSTEMS AND APPLICATIONS, WAMTA 2024, 2024, 14626 : 46 - 58
  • [23] Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
    Fonseca, Alcides
    Cabral, Bruno
    Rafael, Joao
    Correia, Ivo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (06) : 1337 - 1358
  • [24] Automatic Code Generation and Data Management for an Asynchronous Task-based Runtime
    Baskaran, Muthu
    Pradelle, Benoit
    Meister, Benoit
    Konstantinidis, Athanasios
    Lethin, Richard
    PROCEEDINGS OF ESPT 2016: 5TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS, 2016, : 34 - 41
  • [25] Adaptive scheduling of collocated applications using a task-based runtime system
    Dokulil, Jiri
    Benkner, Siegfried
    2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 41 - 48
  • [26] Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System
    Sergent, Marc
    Goudin, David
    Thibault, Samuel
    Aumage, Olivier
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 318 - 327
  • [27] Performance Measurements Within Asynchronous Task-Based Runtime Systems: A Double White Dwarf Merger as an Application
    Diehl, Patrick
    Marcello, Dominic
    Amini, Parsa
    Kaiser, Hartmut
    Shiber, Sagiv
    Clayton, Geoffrey C.
    Frank, Juhan
    Daiss, Gregor
    Pfluger, Dirk
    Eder, David
    Koniges, Alice
    Huck, Kevin
    COMPUTING IN SCIENCE & ENGINEERING, 2021, 23 (03) : 73 - 81
  • [28] Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs
    Beaumont, Olivier
    Eyraud-Dubois, Lionel
    Kumar, Suraj
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 768 - 777
  • [29] Task-Based Design and Runtime Support for Multimodal User Interface Distribution
    Clerckx, Tim
    Vandervelpen, Chris
    Coninx, Karin
    ENGINEERING INTERACTIVE SYSTEMS, 2008, 4940 : 89 - +
  • [30] TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C plus
    Cambier, Leopold
    Qian, Yizhou
    Darve, Eric
    PROCEEDINGS OF 2020 IEEE/ACM 3RD ANNUAL PARALLEL APPLICATIONS WORKSHOP, ALTERNATIVES TO MPI+X (PAW-ATM 2020), 2020, : 16 - 26