An Efficient Hardware Accelerator for Block Sparse Convolutional Neural Networks on FPGA

被引:2
|
作者
Yin, Xiaodi [1 ]
Wu, Zhipeng [1 ]
Li, Dejian [2 ]
Shen, Chongfei [2 ]
Liu, Yu [1 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Beijing Smart Chip Microelect Technol Co Ltd, Beijing 100089, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Field programmable gate arrays; Sparse matrices; Kernel; Encoding; Convolutional neural networks; Neural networks; Block pruning; convolutional neural network (CNN) accelerator; CNNs; field-programmable gate array (FPGA); sparse CNN;
D O I
10.1109/LES.2023.3296507
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Field-programmable gate array (FPGA) has become an excellent hardware accelerator solution for convolutional neural networks (CNNs). Meanwhile, optimizing methods, such as model compression, have been proposed. As most CNN accelerators focus on dense neural networks, to solve the problem of difficult hardware deployment due to irregular networks, we propose a method for sparse neural networks in our work. The storage and coding format of sparse data obtained by the block pruning method is designed to make it friendly to implement on FPGA. Besides, we also propose an efficient and simple data flow by the planarization of the whole convolution calculation process. The experimental result demonstrates that our implementation can achieve clock frequency of 190 MHz, power consumption of 13.32 W and inferencing speed of 16.37 ms. Compared with some typical Mobilenet implementation schemes, our method has been proven to achieve a better balance between frequency, accuracy, power consumption, and speed.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [31] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
    Parashar, Angshuman
    Rhu, Minsoo
    Mukkara, Anurag
    Puglielli, Antonio
    Venkatesan, Rangharajan
    Khailany, Brucek
    Emer, Joel
    Keckler, Stephen W.
    Dally, William J.
    44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 27 - 40
  • [32] Search-free Accelerator for Sparse Convolutional Neural Networks
    Liu, Bosheng
    Chen, Xiaoming
    Han, Yinhe
    Wang, Ying
    Li, Jiajun
    Xu, Haobo
    Li, Xiaowei
    2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 524 - 529
  • [33] Dynamic block sparse reparameterization of convolutional neural networks
    Vooturi, Dharma Teja
    Varma, Girish
    Kothapalli, Kishore
    Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019, 2019, : 3046 - 3053
  • [34] Dynamic Block Sparse Reparameterization of Convolutional Neural Networks
    Vooturi, Dharma Teja
    Varma, Girish
    Kothapalli, Kishore
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3046 - 3053
  • [35] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin Huabiao
    Cao Qinping
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
  • [36] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin H.
    Cao Q.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (11): : 2599 - 2605
  • [37] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
    Gowda, Kavitha Malali Vishveshwarappa
    Madhavan, Sowmya
    Rinaldi, Stefano
    Divakarachari, Parameshachari Bidare
    Atmakur, Anitha
    ELECTRONICS, 2022, 11 (10)
  • [38] An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks
    Lin, Zhijian
    Zhang, Meng
    Weng, Dongpeng
    Liu, Fei
    2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 126 - 130
  • [39] A Power-efficient Accelerator for Convolutional Neural Networks
    Sun, Fan
    Wang, Chao
    Gong, Lei
    Xu, Chongchong
    Zhang, Yiwei
    Lu, Yuntao
    Li, Xi
    Zhou, Xuehai
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 631 - 632
  • [40] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
    Panchbhaiyye, Vineet
    Ogunfunmi, Tokunbo
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (10): : 1117 - 1129