An Efficient Hardware Accelerator for Block Sparse Convolutional Neural Networks on FPGA

被引：2

作者：

Yin, Xiaodi ^{[1
]}

Wu, Zhipeng ^{[1
]}

Li, Dejian ^{[2
]}

Shen, Chongfei ^{[2
]}

Liu, Yu ^{[1
]}

机构：

[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China

[2] Beijing Smart Chip Microelect Technol Co Ltd, Beijing 100089, Peoples R China

来源：

IEEE EMBEDDED SYSTEMS LETTERS | 2024年 / 16卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Convolution; Field programmable gate arrays; Sparse matrices; Kernel; Encoding; Convolutional neural networks; Neural networks; Block pruning; convolutional neural network (CNN) accelerator; CNNs; field-programmable gate array (FPGA); sparse CNN;

D O I：

10.1109/LES.2023.3296507

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Field-programmable gate array (FPGA) has become an excellent hardware accelerator solution for convolutional neural networks (CNNs). Meanwhile, optimizing methods, such as model compression, have been proposed. As most CNN accelerators focus on dense neural networks, to solve the problem of difficult hardware deployment due to irregular networks, we propose a method for sparse neural networks in our work. The storage and coding format of sparse data obtained by the block pruning method is designed to make it friendly to implement on FPGA. Besides, we also propose an efficient and simple data flow by the planarization of the whole convolution calculation process. The experimental result demonstrates that our implementation can achieve clock frequency of 190 MHz, power consumption of 13.32 W and inferencing speed of 16.37 ms. Compared with some typical Mobilenet implementation schemes, our method has been proven to achieve a better balance between frequency, accuracy, power consumption, and speed.

引用

页码：158 / 161

页数：4

共 50 条

[31] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Parashar, Angshuman
Rhu, Minsoo
Mukkara, Anurag
Puglielli, Antonio
Venkatesan, Rangharajan
Khailany, Brucek
Emer, Joel
Keckler, Stephen W.
Dally, William J.
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 27 - 40
[32] Search-free Accelerator for Sparse Convolutional Neural Networks
Liu, Bosheng
Chen, Xiaoming
Han, Yinhe
Wang, Ying
Li, Jiajun
Xu, Haobo
Li, Xiaowei
2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 524 - 529
[33] Dynamic block sparse reparameterization of convolutional neural networks
Vooturi, Dharma Teja
Varma, Girish
Kothapalli, Kishore
Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019, 2019, : 3046 - 3053
[34] Dynamic Block Sparse Reparameterization of Convolutional Neural Networks
Vooturi, Dharma Teja
Varma, Girish
Kothapalli, Kishore
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3046 - 3053
[35] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
Qin Huabiao
Cao Qinping
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
[36] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
Qin H.
Cao Q.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (11): : 2599 - 2605
[37] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
Gowda, Kavitha Malali Vishveshwarappa
Madhavan, Sowmya
Rinaldi, Stefano
Divakarachari, Parameshachari Bidare
Atmakur, Anitha
ELECTRONICS, 2022, 11 (10)
[38] An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks
Lin, Zhijian
Zhang, Meng
Weng, Dongpeng
Liu, Fei
2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 126 - 130
[39] A Power-efficient Accelerator for Convolutional Neural Networks
Sun, Fan
Wang, Chao
Gong, Lei
Xu, Chongchong
Zhang, Yiwei
Lu, Yuntao
Li, Xi
Zhou, Xuehai
2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 631 - 632
[40] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
Panchbhaiyye, Vineet
Ogunfunmi, Tokunbo
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (10): : 1117 - 1129

← 1 2 3 4 5 →