Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks

被引:0
|
作者
Boulasikis, M. A. [1 ]
Birbas, M. [1 ]
Tsafas, N. [1 ]
Kanakaris, N. [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, Patras, Greece
关键词
Deep Learning; Hardware Optimizations; Embedded Systems; Computer Vision; Quantization; System Level Design;
D O I
10.1109/MOCAST52088.2021.9493366
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent advances in the field of computer vision create the demand for larger and more complex architectures for Deep Convolutional Neural Networks (CNNs). As a result, computation time and memory usage become the main bottleneck in applied deep network inference, particularly in Embedded Systems implementations. Parameter quantization is often employed in these cases to minimize the detrimental effect of the aforementioned bottlenecks. In this paper, low level hardware optimizations on fixed point convolution are considered. Emphasis is given on the utilization of Digital Signal Processing Units (DSPs) as dual multipliers and on practical considerations. Three-by-three convolution kernels are formulated based on this research and are measured as case studies. The experiments show that the proper exploitation of dual multipliers can offer significant benefits to the system.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Efficient Implementation of Convolutional Neural Networks on FPGA
    Hadnagy, A.
    Feher, B.
    Kovacshazy, T.
    [J]. 2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
  • [2] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
    Podili, Abhinav
    Zhang, Chi
    Prasanna, Viktor
    [J]. 2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
  • [3] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Vestias, Mario
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 531 - 544
  • [4] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Mário Véstias
    [J]. Journal of Signal Processing Systems, 2021, 93 : 531 - 544
  • [5] Efficient Mitchell's Approximate Log Multipliers for Convolutional Neural Networks
    Kim, Min Soo
    Del Barrio, Alberto A.
    Oliveira, Leonardo Tavares
    Hermida, Roman
    Bagherzadeh, Nader
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (05) : 660 - 675
  • [6] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [7] An Efficient Hardware Accelerator for Block Sparse Convolutional Neural Networks on FPGA
    Yin, Xiaodi
    Wu, Zhipeng
    Li, Dejian
    Shen, Chongfei
    Liu, Yu
    [J]. IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (02) : 158 - 161
  • [8] Accelerator Design with Effective Resource Utilization for Binary Convolutional Neural Networks on an FPGA
    Kim, Sunwoong
    Rutenbar, Rob A.
    [J]. PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 218 - 218
  • [9] Efficient Inference of Large-Scale and Lightweight Convolutional Neural Networks on FPGA
    Wu, Xiao
    Ma, Yufei
    Wang, Zhongfeng
    [J]. 2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 168 - 173
  • [10] An Efficient Convolutional Neural Network Accelerator on FPGA
    Si, Junye
    Jiang, Jianfei
    Wang, Qin
    Huang, Jia
    [J]. 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1392 - 1394