A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs

被引:0
|
作者
Wang, Zeke [1 ]
He, Bingsheng [1 ]
Zhang, Wei [2 ]
Jiang, Shunning [1 ,3 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] HKUST, Clear Water Bay, Hong Kong, Peoples R China
[3] Cornell Univ, Ithaca, NY 14853 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, FPGA vendors such as Altera and Xilinx have released OpenCL SDK for programming FPGAs. However, the architecture of FPGA is significantly different from that of CPU/GPU, for which OpenCL is originally designed. Tuning the OpenCL code for good performance on FPGAs is still an open problem, since the existing OpenCL tools and models designed for CPUs/GPUs are not directly applicable to FPGAs. In the paper, we present an FPGA-based performance analysis framework that can shed light on the performance bottleneck and thus guide the code tuning for OpenCL applications on FPGAs. Particularly, we leverage static and dynamic analysis to develop an analytical performance model, which has captured the key architectural features of FPGA abstractions under OpenCL. Then, we provide four programmer-interpretable metrics to quantify the performance potentials of the OpenCL program with input optimization combination for the next optimization step. We evaluate our framework with a number of user cases, and demonstrate that 1) our analytical performance model can accurately predict the performance of OpenCL programs with different optimization combinations on FPGAs, and 2) our tool can be used to effectively guide the code tuning on alleviating the performance bottleneck.
引用
收藏
页码:114 / 125
页数:12
相关论文
共 50 条
  • [1] Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs
    Zohouri, Hamid Reza
    Maruyama, Naoya
    Smith, Aaron
    Matsuda, Motohiko
    Matsuoka, Satoshi
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 409 - 420
  • [2] Optimizing OpenCL Kernels and Runtime for DNN Inference on FPGAs
    Chung, Seung-Hun
    Abdelrahman, Tarek S.
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 151 - 154
  • [3] Boyi: A Systematic Framework for Automatically Deciding the Right Execution Model of OpenCL Applications on FPGAs
    Jiang, Jiantong
    Wang, Zeke
    Liu, Xue
    Gomez-Luna, Juan
    Guan, Nan
    Deng, Qingxu
    Zhang, Wei
    Mutlu, Onur
    [J]. 2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 299 - 309
  • [4] Melia: A MapReduce Framework on OpenCL-Based FPGAs
    Wang, Zeke
    Zhang, Shuhao
    He, Bingsheng
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (12) : 3547 - 3560
  • [5] Exploring Pipe Implementations using an OpenCL Framework for FPGAs
    Mirian, Vincent
    Chow, Paul
    [J]. 2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 112 - 119
  • [6] An OpenCL Framework for Distributed Apps on a Multidimensional Network of FPGAs
    Lawande, Abhijeet
    George, Alan D.
    Lam, Herman
    [J]. PROCEEDINGS OF 2016 6TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURE AND ALGORITHMS (IA3), 2016, : 42 - 49
  • [7] SOFF: An OpenCL High-Level Synthesis Framework for FPGAs
    Jo, Gangwon
    Kim, Heehoon
    Lee, Jeesoo
    Lee, Jaejin
    [J]. 2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020), 2020, : 295 - 308
  • [8] FIexCL: A Model of Performance and Power for OpenCL Workloads on FPGAs
    Liang, Yun
    Wang, Shuo
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1750 - 1764
  • [9] A Comprehensive Framework for Synthesizing Stencil Algorithms on FPGAs using OpenCL Model
    Wang, Shuo
    Liang, Yun
    [J]. PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [10] Evaluating Shared Virtual Memory in an OpenCL Framework for Embedded Systems on FPGAs
    Mirian, Vincent
    Chow, Paul
    [J]. 2015 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2015,