Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs

被引：10

作者：

Wang, Deguang ^{[1
]}

Shen, Junzhong ^{[1
]}

Wen, Mei ^{[1
]}

Zhang, Chunyuan ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China

来源：

ELECTRONICS | 2019年 / 8卷 / 07期

关键词：

DCNN; FPGA; pruning; sparsity; acceleration; 2D; 3D; uniform architecture;

D O I：

10.3390/electronics8070803

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Three-dimensional (3D) deconvolution is widely used in many computer vision applications. However, most previous works have only focused on accelerating two-dimensional (2D) deconvolutional neural networks (DCNNs) on Field-Programmable Gate Arrays (FPGAs), while the acceleration of 3D DCNNs has not been well studied in depth as they have higher computational complexity and sparsity than 2D DCNNs. In this paper, we focus on the acceleration of both 2D and 3D sparse DCNNs on FPGAs by proposing efficient schemes for mapping 2D and 3D sparse DCNNs on a uniform architecture. Firstly, a pruning method is used to prune unimportant network connections and increase the sparsity of weights. After being pruned, the number of parameters of DCNNs is reduced significantly without accuracy loss. Secondly, the remaining non-zero weights are encoded in coordinate (COO) format, reducing the memory demands of parameters. Finally, to demonstrate the effectiveness of our work, we implement our accelerator design on the Xilinx VC709 evaluation platform for four real-life 2D and 3D DCNNs. After the first two steps, the storage required of DCNNs is reduced up to 3.9x. Results show that the performance of our method on the accelerator outperforms that of the our prior work by 2.5x to 3.6x in latency.

引用

页数：13

共 50 条

[1] Towards a Uniform Architecture for the Efficient Implementation of 2D and 3D Deconvolutional Neural Networks on FPGAs
Wang, Deguang
Shen, Junzhong
Wen, Mei
Zhang, Chunyuan
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[2] A Uniform Architecture Design for Accelerating 2D and 3D CNNs on FPGAs
Liu, Zhiqiang
Chow, Paul
Xu, Jinwei
Jiang, Jingfei
Dou, Yong
Zhou, Jie
ELECTRONICS, 2019, 8 (01)
[3] Efficient and flexible MATLAB implementation of 2D and 3D elastoplastic problems
Cermak, M.
Sysala, S.
Valdman, J.
APPLIED MATHEMATICS AND COMPUTATION, 2019, 355 : 595 - 614
[4] Efficient Converted Spiking Neural Network for 3D and 2D Classification
Lan, Yuxiang
Zhang, Yachao
Ma, Xu
Qu, Yanyun
Fu, Yun
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9177 - 9186
[5] Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries
Soman, Sumedh
Mehendale, Ninad
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04): : 1815 - 1832
[6] Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries
Sumedh Soman
Ninad Mehendale
Neural Computing and Applications, 2024, 36 : 1805 - 1813
[7] Evolutionary Neural Architecture Search for 2D and 3D Medical Image Classification
Ali, Muhammad Junaid
Moalic, Laurent
Essaid, Mokhtar
Idoumghar, Lhassane
COMPUTATIONAL SCIENCE, ICCS 2024, PT II, 2024, 14833 : 131 - 146
[8] Efficient implementation of the rank correlation merit function for 2D/3D registration
Figl, M.
Bloch, C.
Gendrin, C.
Weber, C.
Pawiro, S. A.
Hummel, J.
Markelj, P.
Pernus, F.
Bergmann, H.
Birkfellner, W.
PHYSICS IN MEDICINE AND BIOLOGY, 2010, 55 (19): : N465 - N471
[9] An Implementation of 3D Electron Tomography on FPGAs
Gruell, Frederik
Kunz, Michael
Hausmann, Michael
Kebschull, Udo
2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
[10] Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA
Shen, Junzhong
Huang, You
Wang, Zelong
Qiao, Yuran
Wen, Mei
Zhang, Chunyuan
PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 97 - 106

← 1 2 3 4 5 →