Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs

被引:10
|
作者
Wang, Deguang [1 ]
Shen, Junzhong [1 ]
Wen, Mei [1 ]
Zhang, Chunyuan [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
关键词
DCNN; FPGA; pruning; sparsity; acceleration; 2D; 3D; uniform architecture;
D O I
10.3390/electronics8070803
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) deconvolution is widely used in many computer vision applications. However, most previous works have only focused on accelerating two-dimensional (2D) deconvolutional neural networks (DCNNs) on Field-Programmable Gate Arrays (FPGAs), while the acceleration of 3D DCNNs has not been well studied in depth as they have higher computational complexity and sparsity than 2D DCNNs. In this paper, we focus on the acceleration of both 2D and 3D sparse DCNNs on FPGAs by proposing efficient schemes for mapping 2D and 3D sparse DCNNs on a uniform architecture. Firstly, a pruning method is used to prune unimportant network connections and increase the sparsity of weights. After being pruned, the number of parameters of DCNNs is reduced significantly without accuracy loss. Secondly, the remaining non-zero weights are encoded in coordinate (COO) format, reducing the memory demands of parameters. Finally, to demonstrate the effectiveness of our work, we implement our accelerator design on the Xilinx VC709 evaluation platform for four real-life 2D and 3D DCNNs. After the first two steps, the storage required of DCNNs is reduced up to 3.9x. Results show that the performance of our method on the accelerator outperforms that of the our prior work by 2.5x to 3.6x in latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Towards a Uniform Architecture for the Efficient Implementation of 2D and 3D Deconvolutional Neural Networks on FPGAs
    Wang, Deguang
    Shen, Junzhong
    Wen, Mei
    Zhang, Chunyuan
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [2] A Uniform Architecture Design for Accelerating 2D and 3D CNNs on FPGAs
    Liu, Zhiqiang
    Chow, Paul
    Xu, Jinwei
    Jiang, Jingfei
    Dou, Yong
    Zhou, Jie
    ELECTRONICS, 2019, 8 (01)
  • [3] Efficient and flexible MATLAB implementation of 2D and 3D elastoplastic problems
    Cermak, M.
    Sysala, S.
    Valdman, J.
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 355 : 595 - 614
  • [4] Efficient Converted Spiking Neural Network for 3D and 2D Classification
    Lan, Yuxiang
    Zhang, Yachao
    Ma, Xu
    Qu, Yanyun
    Fu, Yun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9177 - 9186
  • [5] Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries
    Soman, Sumedh
    Mehendale, Ninad
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04): : 1815 - 1832
  • [6] Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries
    Sumedh Soman
    Ninad Mehendale
    Neural Computing and Applications, 2024, 36 : 1805 - 1813
  • [7] Evolutionary Neural Architecture Search for 2D and 3D Medical Image Classification
    Ali, Muhammad Junaid
    Moalic, Laurent
    Essaid, Mokhtar
    Idoumghar, Lhassane
    COMPUTATIONAL SCIENCE, ICCS 2024, PT II, 2024, 14833 : 131 - 146
  • [8] Efficient implementation of the rank correlation merit function for 2D/3D registration
    Figl, M.
    Bloch, C.
    Gendrin, C.
    Weber, C.
    Pawiro, S. A.
    Hummel, J.
    Markelj, P.
    Pernus, F.
    Bergmann, H.
    Birkfellner, W.
    PHYSICS IN MEDICINE AND BIOLOGY, 2010, 55 (19): : N465 - N471
  • [9] An Implementation of 3D Electron Tomography on FPGAs
    Gruell, Frederik
    Kunz, Michael
    Hausmann, Michael
    Kebschull, Udo
    2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
  • [10] Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA
    Shen, Junzhong
    Huang, You
    Wang, Zelong
    Qiao, Yuran
    Wen, Mei
    Zhang, Chunyuan
    PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 97 - 106