Measurement and Analysis of GPU-Accelerated OpenCL Computations on Intel GPUs

被引:2
|
作者
Cherian, Aaron Thomas [1 ]
Zhou, Keren [1 ]
Grubisic, Dejan [1 ]
Meng, Xiaozhu [1 ]
Mellor-Crummey, John [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77251 USA
关键词
Supercomputers; High performance computing; Performance analysis; Parallel programming;
D O I
10.1109/ProTools54808.2021.00009
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Graphics Processing Units (GPUs) have become a key technology for accelerating node performance in supercomputers, including the US Department of Energy's forthcoming exascale systems. Since the execution model for GPUs differs from that for conventional processors, applications need to be rewritten to exploit GPU parallelism. Performance tools are needed for such GPU-accelerated systems to help developers assess how well applications offload computation onto GPUs. In this paper, we describe extensions to Rice University's HPCToolkit performance tools that support measurement and analysis of Intel's DPC++ programming model for GPU-accelerated systems atop an implementation of the industry-standard OpenCL framework for heterogeneous parallelism on Intel GPUs. HPCToolkit supports three techniques for performance analysis of programs atop OpenCL on Intel GPUs. First, HPCToolkit supports profiling and tracing of OpenCL kernels. Second, HPCToolkit supports CPU-GPU blame shifting for OpenCL kernel executions-a profiling technique that can identify code that executes on one or more CPUs while GPUs are idle. Third, HPCToolkit supports fine-grained measurement, analysis, and attribution of performance metrics to OpenCL GPU kernels, including instruction counts, execution latency, and SIMD waste. The paper describes these capabilities and then illustrates their application in case studies with two applications that offload computations onto Intel GPUs.
引用
收藏
页码:26 / 35
页数:10
相关论文
共 50 条
  • [1] GPU-accelerated molecular mechanics computations
    Anthopoulos, Athanasios
    Grimstead, Ian
    Brancale, Andrea
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2013, 34 (26) : 2249 - 2260
  • [2] Measurement and analysis of GPU-accelerated applications with HPCToolkit
    Zhou, Keren
    Adhianto, Laksono
    Anderson, Jonathon
    Cherian, Aaron
    Grubisic, Dejan
    Krentel, Mark
    Liu, Yumeng
    Meng, Xiaozhu
    Mellor-Crummey, John
    PARALLEL COMPUTING, 2021, 108
  • [3] GPU-Accelerated Computation for Texture Features using OpenCL Framework
    Saladin, Ahmad M.
    Jiao, Licheng
    Zhang, Xiangrong
    2014 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2014,
  • [4] GPU-Accelerated Dynamic Functional Connectivity Analysis for Functional MRI Data Using OpenCL
    Akgun, Devrim
    Sakoglu, Uenal
    Mete, Mutlu
    Esquivel, Johnny
    Adinoff, Bryon
    2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 479 - 484
  • [5] Analysis of OpenCL work-group reduce for Intel GPUs
    Lupescu, Grigore
    Slusanschi, Emil-Ioan
    Tapus, Nicolae
    PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC), 2016, : 417 - 423
  • [6] GPU-Accelerated Computations for Supersonic Flow Modeling on Hybrid Grids
    Tian, Zhengyu
    Lai, Jianqi
    Yang, Fan
    Li, Hua
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1391 - 1397
  • [7] GPU-Accelerated Static Timing Analysis
    Guo, Zizheng
    Huang, Tsung-Wei
    Lin, Yibo
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [8] GPU-Accelerated Computations of FDTD-Compatible Green's Function
    Stefanski, Tomasz P.
    Krzyzanowska, Katarzyna
    2013 7TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2013, : 2643 - 2646
  • [9] GPU-Accelerated Microdosimetry
    Decunha, J.
    Mohan, R.
    MEDICAL PHYSICS, 2022, 49 (06) : E467 - E468
  • [10] GPU-accelerated CellProfiler
    Chakroun, Imen
    Michiels, Nick
    Wuyts, Roel
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 321 - 326