A tool for top-down performance analysis of GPU-accelerated applications

被引:6
|
作者
Zhou, Keren [1 ]
Krentel, Mark [1 ]
Mellor-Crummey, John [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77251 USA
关键词
GPU; Profiler; Wait-free data structure; Calling context tree; HPC; Roofline;
D O I
10.1145/3332466.3374534
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To support performance measurement and analysis of GPU-accelerated applications, we extended the HPCToolkit performance tools with several novel features. To support efficient monitoring of accelerated applications, HPCToolkit employs a new wait-free data structure to coordinate measurement and attribution between each application thread and a GPU monitor thread. To help developers understand the performance of accelerated applications, HPCToolkit attributes metrics to heterogeneous calling contexts that span both CPUs and GPUs. To support fine-grain analysis and tuning of GPU-accelerated code, HPCToolkit collects PC samples of both CPU and GPU activity to derive and attribute metrics at all levels in a heterogeneous calling context.
引用
收藏
页码:415 / 416
页数:2
相关论文
共 50 条
  • [31] GPU-accelerated image alignment for object detection in industrial applications
    Le, Trung-Son
    Lin, Chyi-Yeu
    2017 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2017, : 13 - 16
  • [32] GPU-accelerated shape simplification for mechanical-based applications
    Hjelmervik, Jon
    Leon, Jean-Claude
    IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS 2007, PROCEEDINGS, 2007, : 91 - +
  • [33] Low Overhead and Context Sensitive Profiling of GPU-accelerated Applications
    Zhou, Keren
    Anderson, Jonathon
    Meng, Xiaozhu
    Mellor-Crummey, John
    PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [34] TOP-DOWN SEMANTIC ANALYSIS
    ADORNI, G
    BOCCALATTE, A
    DIMANZO, M
    COMPUTER JOURNAL, 1984, 27 (03): : 233 - 237
  • [35] A top-down analysis for reconstruction
    Guilliot, Nicolas
    LINGUA, 2006, 116 (11) : 1888 - 1914
  • [36] A GPU-ACCELERATED MULTIPHASE COMPUTATIONAL TOOL FOR ASTEROID FRAGMENTATION/PULVERIZATION SIMULATION
    Zimmerman, Ben J.
    Wie, Bong
    SPACEFLIGHT MECHANICS 2016, PTS I-IV, 2016, 158 : 3575 - 3591
  • [37] Cactus: Top-Down GPU-Compute Benchmarking using Real-Life Applications
    Naderan-Tahan, Mahmood
    Eeckhout, Lieven
    2021 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2021), 2021, : 176 - 188
  • [38] GPU-accelerated computational tool for studying the effectiveness of asteroid disruption techniques
    Zimmerman, Ben J.
    Wie, Bong
    ACTA ASTRONAUTICA, 2016, 127 : 644 - 654
  • [39] GPU-accelerated Path-based Timing Analysis
    Guo, Guannan
    Huang, Tsung-Wei
    Lin, Yibo
    Wong, Martin
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 721 - 726
  • [40] Highly accurate GPU-accelerated pKa prediction tool arrives in amber
    Harris, Robert
    Shen, Jana
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257