An Energy-Efficient Implementation of Group Pruned CNNs on FPGA

被引:10
|
作者
Pang, Wei [1 ]
Wu, Chenglu [1 ]
Lu, Shengli [1 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Res Ctr, Nanjing 210096, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Field programmable gate arrays; Mathematical model; Optimization; Energy efficiency; Filtering algorithms; Convolutional neural networks; Feature extraction; Neural network hardware; accelerator architectures; compression algorithms; field programmable gate arrays; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2020.3041464
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, convolutional neural network (CNN)-based artificial intelligence algorithms have been widely applied to object recognition and image classification tasks. However, the high performance of convolutional neural networks comes at the cost of high-intensity computing and enormous numbers of parameters, which pose substantial challenges to terminal implementations. An end-to-end FPGA-based accelerator is proposed in this work that efficiently processes fine-grained pruned CNNs. A group pruning algorithm with group sparse regularization (GSR) is introduced to solve internal buffer misalignments and load imbalances of the accelerator after fine-grained pruning. A mathematical model of accelerator access and transmission is established to explore the optimal design scale and calculation mode. The accelerator is optimized by designing sparse processing elements and by scheduling the on- and off-chip buffers. The proposed approach reduces the computation of a state-of-the-art large-scale CNN, VGG16, by 86.9% with an accuracy loss on CIFAR-10 of only 0.48%. The accelerator achieves 188.41 GOPS at 100 MHz and consumes 8.15 W when implemented on a Xilinx VC707, making it more energy-efficient than previous approaches.
引用
收藏
页码:217033 / 217044
页数:12
相关论文
共 50 条
  • [1] Energy-Efficient Architecture for CNNs Inference on Heterogeneous FPGA
    Spagnolo, Fanny
    Perri, Stefania
    Frustaci, Fabio
    Corsonello, Pasquale
    [J]. JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2020, 10 (01)
  • [2] An Energy-Efficient Reconfigurable Autoencoder Implementation on FPGA
    Isik, Murat
    Oldland, Matthew
    Zhou, Lifeng
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023, 2024, 822 : 212 - 222
  • [3] Energy -Efficient CNNs Accelerator Implementation on FPGA with Optimized Storage and Dataflow
    Zhang, Yonghua
    Jiang, Hongxu
    Li, Xiaobin
    Miao, Rui
    Nie, Jinyan
    Du, Yu
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1209 - 1214
  • [4] OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm
    Lou, Wenqi
    Wang, Chao
    Gong, Lei
    Zhou, Xuehai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 410 - 411
  • [5] Energy-Efficient CNN Implementation on a Deeply Pipelined FPGA Cluster
    Zhang, Chen
    Wu, Di
    Sun, Jiayu
    Sun, Guangyu
    Luo, Guojie
    Cong, Jason
    [J]. ISLPED '16: PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2016, : 326 - 331
  • [6] Energy-Efficient Histogram on FPGA
    Sanny, Andrea
    Yang, Yi-Hua E.
    Prasanna, Viktor K.
    [J]. 2014 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2014,
  • [7] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
    Irmak, Hasan
    Alachiotis, Nikolaos
    Ziener, Daniel
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [8] Energy-Efficient FPGA Implementation for Binomial Option Pricing Using OpenCL
    Morales, Valentin Mena
    Horrein, Pierre-Henri
    Baghdadi, Amer
    Hochapfel, Erik
    Vaton, Sandrine
    [J]. 2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [9] Implementation of energy-efficient convolutional neural networks based on kernel-pruned silicon photonics
    Huang, Duan
    Xiong, Yanting
    Xing, Zhuangzhuang
    Zhang, Qi
    [J]. OPTICS EXPRESS, 2023, 31 (16) : 25865 - 25880
  • [10] Real life implementation of an energy-efficient adaptive advance encryption design on FPGA
    Bisht, Neeraj
    Pandey, Bishwajeet
    Budhani, Sandeep Kumar
    [J]. INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2023, 16 (02) : 105 - 116