A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs

被引:0
|
作者
Wang, Zeke [1 ]
He, Bingsheng [1 ]
Zhang, Wei [2 ]
Jiang, Shunning [1 ,3 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] HKUST, Clear Water Bay, Hong Kong, Peoples R China
[3] Cornell Univ, Ithaca, NY 14853 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, FPGA vendors such as Altera and Xilinx have released OpenCL SDK for programming FPGAs. However, the architecture of FPGA is significantly different from that of CPU/GPU, for which OpenCL is originally designed. Tuning the OpenCL code for good performance on FPGAs is still an open problem, since the existing OpenCL tools and models designed for CPUs/GPUs are not directly applicable to FPGAs. In the paper, we present an FPGA-based performance analysis framework that can shed light on the performance bottleneck and thus guide the code tuning for OpenCL applications on FPGAs. Particularly, we leverage static and dynamic analysis to develop an analytical performance model, which has captured the key architectural features of FPGA abstractions under OpenCL. Then, we provide four programmer-interpretable metrics to quantify the performance potentials of the OpenCL program with input optimization combination for the next optimization step. We evaluate our framework with a number of user cases, and demonstrate that 1) our analytical performance model can accurately predict the performance of OpenCL programs with different optimization combinations on FPGAs, and 2) our tool can be used to effectively guide the code tuning on alleviating the performance bottleneck.
引用
收藏
页码:114 / 125
页数:12
相关论文
共 50 条
  • [41] Optimization of Low-Density Parity Check decoder performance for OpenCL designs synthesized to FPGAs
    Maier, Andrew J.
    Cockburn, Bruce F.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 107 : 134 - 145
  • [42] High Performance High-Precision Floating-Point Operations on FPGAs using OpenCL
    Nakasato, Naohito
    Daisaka, Hiroshi
    Ishikawa, Tadashi
    [J]. 2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 265 - 268
  • [43] Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL
    Zohouri, Hamid Reza
    Podobas, Artur
    Matsuoka, Satoshi
    [J]. PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 153 - 162
  • [44] Pulsar search acceleration using FPGAs and OpenCL templates
    Julian Oppermann
    Mitchell B. Mickaliger
    Oliver Sinnen
    [J]. Experimental Astronomy, 2023, 56 : 239 - 266
  • [45] Developing Dynamic Profiling and Debugging Support in OpenCL for FPGAs
    Verma, Anshuman
    Zhou, Huiyang
    Booth, Skip
    King, Robbie
    Coole, James
    Keep, Andy
    Marshall, John
    Feng, Wu-chun
    [J]. PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [46] A Case for Work-stealing on FPGAs with OpenCL Atomics
    Ramanathan, Nadesh
    Wickerson, John
    Winterstein, Felix
    Constantinides, George A.
    [J]. PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 48 - 53
  • [47] Workload distribution and balancing in FPGAs and CPUs with OpenCL and TBB
    Asenjo, Rafael
    Navarro, Angeles
    Rodriguez, Andres
    Nunez-Yanez, Jose
    [J]. PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 543 - 551
  • [48] Pulsar search acceleration using FPGAs and OpenCL templates
    Oppermann, Julian
    Mickaliger, Mitchell B. B.
    Sinnen, Oliver
    [J]. EXPERIMENTAL ASTRONOMY, 2023, 56 (01) : 239 - 266
  • [49] Hardware Thread Reordering to Boost OpenCL Throughput on FPGAs
    Momeni, Amir
    Tabkhi, Hamed
    Schirner, Gunar
    Kaeli, David
    [J]. PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 257 - 264
  • [50] Relational Query Processing on OpenCL-based FPGAs
    Wang, Zeke
    Paul, Johns
    Ntu, Hui Yan Cheah
    He, Bingsheng
    Zhang, Wei
    [J]. 2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,