In-FPGA Instrumentation Framework for OpenCL-Based Designs

被引:1
|
作者
Bensalem, Hachem [1 ]
Blaquiere, Yves [1 ]
Savaria, Yvon [2 ]
机构
[1] Ecole Technol Super, Dept Elect Engn, Montreal, PQ H3C 1K3, Canada
[2] Polytech Montreal, Dept Elect Engn, Montreal, PQ H3T 1J4, Canada
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
加拿大自然科学与工程研究理事会;
关键词
Field programmable gate arrays; Instruments; Tools; Kernel; Debugging; Hardware; Benchmark testing; OpenCL; FPGA; instrumentation; high-performance reconfigurable computing; HLS; timing performance;
D O I
10.1109/ACCESS.2020.3040081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The productivity achieved when developing applications on high-performance reconfigurable heterogeneous computing (HPRHC) systems is increased by using the Open Computing Language (OpenCL). However, the hardware produced by OpenCL compilers in field-programmable gate arrays (FPGAs) can result in severe performance bottlenecks that are challenging to solve. The problem is compounded by the fact that the generated netlist details are disorganized, making them mostly unreadable and only partially visible to designers. This paper proposes an in-FPGA instrumentation method and a new framework for extracting the FPGA-cycle-accurate timing performances of OpenCL-based designs. The results clearly show that the chosen execution model for OpenCL-based designs strongly affects the timing performance when it is not properly implemented. Our framework is implemented on an HPRHC platform that contains a CPU and two Arria10 FPGAs, and it is evaluated with a wide variety of benchmarks with different complexities. After testing on the reported benchmarks, the average logic overhead for one inserted instrument is 0.2 % of the total amount of adaptive look-up tables (ALUTs) and 0.1 % of the total registers in an FPGA. This resource utilization is between 1.5 and six times lower than those reported in the best previously published works. The scalability of the framework is also evaluated by inserting up to 50 instruments. The experimental results show that the average logic utilization per instrument is 0.19 % of the ALUTs and 0.17 % of the registers in the FPGA when 50 instruments are inserted.
引用
收藏
页码:212979 / 212994
页数:16
相关论文
共 50 条
  • [1] Toward In-System Monitoring of OpenCL-Based Designs on FPGA
    Bensalem, Hachem
    Blaquiere, Yves
    Savaria, Yvon
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [2] Research on Parallel Architecture of OpenCL-Based FPGA
    Zhang, Yi
    Cai, Ye
    Luo, Qiuming
    [J]. SMART COMPUTING AND COMMUNICATION, SMARTCOM 2017, 2018, 10699 : 27 - 38
  • [3] Melia: A MapReduce Framework on OpenCL-Based FPGAs
    Wang, Zeke
    Zhang, Shuhao
    He, Bingsheng
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (12) : 3547 - 3560
  • [4] Evaluation of an OpenCL-Based FPGA Platform for Particle Filter
    Tatsumi, Shunsuke
    Hariyama, Masanori
    Ikoma, Norikazu
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (05) : 743 - 754
  • [5] OpenCL-based design of an FPGA accelerator for quantum annealing simulation
    Hasitha Muthumala Waidyasooriya
    Masanori Hariyama
    Masamichi J. Miyama
    Masayuki Ohzeki
    [J]. The Journal of Supercomputing, 2019, 75 : 5019 - 5039
  • [6] OpenCL-based design of an FPGA accelerator for quantum annealing simulation
    Waidyasooriya, Hasitha Muthumala
    Hariyama, Masanori
    Miyama, Masamichi J.
    Ohzeki, Masayuki
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (08): : 5019 - 5039
  • [7] An OpenCL-Based FPGA Accelerator for Compressed YOLOv2
    Yang, Anrong
    Li, Yuanhui
    Shu, Hongqiao
    Deng, Jianlin
    Ma, Chuanzhao
    Li, Zheng
    Wang, Qigang
    [J]. 2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 235 - 238
  • [8] A Scalable OpenCL-Based FPGA Accelerator For YOLOv2
    Xu, Ke
    Wang, Xiaoyun
    Wang, Dong
    [J]. 2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, : 317 - 317
  • [9] An OpenCL-Based FPGA Accelerator for Faster R-CNN
    An, Jianjing
    Zhang, Dezheng
    Xu, Ke
    Wang, Dong
    [J]. ENTROPY, 2022, 24 (10)
  • [10] An OpenCL-Based Hybrid CNN-RNN Inference Accelerator On FPGA
    Sun, Yunfei
    Liu, Brian
    Xu, Xianchao
    [J]. 2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 283 - 286