Extending High-Level Synthesis with High-Performance Computing Performance Visualization

被引:0
|
作者
Huthmann, Jens [1 ]
Podobas, Artur [2 ]
Sommer, Lukas [3 ]
Koch, Andreas [3 ]
Sano, Kentaro [1 ]
机构
[1] RIKEN, Ctr Computat Sci, Kobe, Hyogo, Japan
[2] KTH, Royal Inst Technol, Stockholm, Sweden
[3] Tech Univ Darmstadt, Embedded Syst & Applicat Grp, Darmstadt, Germany
关键词
Visualization; FPGA; HLS; High-Level Synthesis; High-Performance Computing; Performance Optimization;
D O I
10.1109/CLUSTER49012.2020.00047
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The recent maturity in High-Level Synthesis (HLS) has renewed the interest of using Field-Programmable Gate-Arrays (FPGAs) to accelerate High-Performance Computing (HPC) applications. Today, several studies have shown performance- and power-benefits of using FPGAs compared to existing approaches for a number of application kernels with ample room for improvements. Unfortunately, modern HLS tools offer little support to gain clarity and insight regarding why a certain application behaves as it does on the FPGA, and most experts rely on intuition or abstract performance models. In this work, we hypothesize that existing profiling and visualization tools used in the HPC domain are also usable for understanding performance on FPGAs. We extend an existing HLS tool-chain to support Paraver - a state-of-the-art visualization and profiling tool well-known in HPC. We describe how each of the events and states are collected, and empirically quantify its hardware overhead. Finally, we practically apply our contribution to two different applications, demonstrating how the tool can be used to provide unique insights into application execution and how it can be used to guide optimizations.
引用
收藏
页码:371 / 380
页数:10
相关论文
共 50 条
  • [1] Transformations of High-Level Synthesis Codes for High-Performance Computing
    de Fine Licht, Johannes
    Besta, Maciej
    Meierhans, Simon
    Hoefler, Torsten
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1014 - 1029
  • [2] Performance Analysis with High-Level Languages for High-Performance Reconfigurable Computing
    Curreri, John
    Koehler, Seth
    Holland, Brian
    George, Alan D.
    [J]. PROCEEDINGS OF THE SIXTEENTH IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2008, : 23 - 30
  • [3] Evaluating High-Level Design Strategies on FPGAs for High-Performance Computing
    Podobas, Artur
    Zohouri, Hamid Reza
    Maruyama, Naoya
    Matsuoka, Satoshi
    [J]. 2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [4] Evaluating High-Level Design Strategies on FPGAs for High-Performance Computing
    Podobas, Artur
    Zohouri, Hamid Reza
    Maruyama, Naoya
    Matsuoka, Satoshi
    [J]. 2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [5] Efficient FPGA Implementation of OpenCL High-Performance Computing Applications via High-Level Synthesis
    Bin Muslim, Fahad
    Ma, Liang
    Roozmeh, Mehdi
    Lavagno, Luciano
    [J]. IEEE ACCESS, 2017, 5 : 2747 - 2762
  • [6] Effective High-Level Synthesis for High-Performance Graph Processing
    Tang, Jiawu
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (03): : 467 - 478
  • [7] Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs
    Lambert, Jacob
    Lee, Seyong
    Kim, Jungwon
    Vetter, Jeffrey S.
    Malony, Allen D.
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 160 - 171
  • [8] Data Analysis and Visualization in High-Performance Computing
    Szczepariski, Amy F.
    Huang, Jian
    Baer, Troy
    Mack, Yashema C.
    Ahern, Sean
    [J]. COMPUTER, 2013, 46 (05) : 84 - 92
  • [9] From High-Level Specification to High-Performance Code
    Franchetti, Franz
    Moura, Jose M. F.
    Padua, David A.
    Dongarra, Jack
    [J]. PROCEEDINGS OF THE IEEE, 2018, 106 (11) : 1875 - 1878
  • [10] Resolve: Generation of High-Performance Sorting Architectures from High-Level Synthesis
    Matai, Janarbek
    Richmond, Dustin
    Lee, Dajung
    Blair, Zac
    Wu, Qiongzhi
    Abazari, Amin
    Kastner, Ryan
    [J]. PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 195 - 204