Efficient Design of Pruned Convolutional Neural Networks on FPGA

被引:0
|
作者
Mário Véstias
机构
[1] Instituto Politécnico de Lisboa,INESC
来源
关键词
Deep learning; Convolutional neural network; FPGA; Block pruning; Edge computing;
D O I
暂无
中图分类号
学科分类号
摘要
Convolutional Neural Networks (CNNs) have improved several computer vision applications, like object detection and classification, when compared to other machine learning algorithms. Running these models in edge computing devices close to data sources is attracting the attention of the community since it avoids high-latency data communication of private data for cloud processing and permits real-time decisions turning these systems into smart embedded devices. Running these models is computationally very demanding and requires a large amount of memory, which are scarce in edge devices compared to a cloud center. In this paper, we proposed an architecture for the inference of pruned convolutional neural networks in any density FPGAs. A configurable block pruning method is proposed together with an architecture that supports the efficient execution of pruned networks. Also, pruning and batching are studied together to determine how they influence each other. With the proposed architecture, we run the inference of a CNN with an average performance of 322 GOPs for 8-bit data in a XC7Z020 FPGA. The proposed architecture running AlexNet processes 240 images/s in a ZYNQ7020 and 775 images/s in a ZYNQ7045 with only 1.2% accuracy degradation.
引用
收藏
页码:531 / 544
页数:13
相关论文
共 50 条
  • [1] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Vestias, Mario
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 531 - 544
  • [2] Efficient Implementation of Convolutional Neural Networks on FPGA
    Hadnagy, A.
    Feher, B.
    Kovacshazy, T.
    [J]. 2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
  • [3] Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks
    Boulasikis, M. A.
    Birbas, M.
    Tsafas, N.
    Kanakaris, N.
    [J]. 2021 10TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2021,
  • [4] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
    Podili, Abhinav
    Zhang, Chi
    Prasanna, Viktor
    [J]. 2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
  • [5] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [6] Reliability evaluation of FPGA based pruned neural networks
    Gao, Zhen
    Yao, Yi
    Wei, Xiaohui
    Yan, Tong
    Zeng, Shulin
    Ge, Guangjun
    Wang, Yu
    Ullah, Anees
    Reviriego, Pedro
    [J]. MICROELECTRONICS RELIABILITY, 2022, 130
  • [7] Efficient Design of Low Bitwidth Convolutional Neural Networks on FPGA with Optimized Dot Product Units
    Vestias, Mario
    Duarte, Rui P.
    de Sousa, Jose T.
    Neto, Horacio
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (01)
  • [8] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin Huabiao
    Cao Qinping
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
  • [9] Design Space Exploration of FPGA Accelerators for Convolutional Neural Networks
    Rahman, Atul
    Oh, Sangyun
    Lee, Jongeun
    Choi, Kiyoung
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1147 - 1152
  • [10] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin, Huabiao
    Cao, Qinping
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (11): : 2599 - 2605