A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN

被引:0
|
作者
Bai, Lin [1 ]
Lyu, Yecheng [1 ]
Huang, Xinming [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deconvolution plays an important role in the state-of-the-art convolutional neural networks (CNNs) for the tasks like semantic segmentation, image super resolution, etc. In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum. As an example, SegNet-Basic has been implemented using the proposed unified architecture by targeting on Xilinx ZC706 FPGA, which achieves the performance of 151.5 GOPS and 94.3 GOPS for convolution and deconvolution respectively. This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] HARDWARE ARCHITECTURE OR THE ARCHITECTURE OF TECHNOLOGY
    SHARP, D
    ARCHITECTURE D AUJOURD HUI, 1980, 212 (DEC): : 2 - 7
  • [32] A New Hardware-Efficient VLSI-Architecture of GoogLeNet CNN-Model Based Hardware Accelerator for Edge Computing Applications
    Islam, Md. Najrul
    Shrestha, Rahul
    Chowdhury, Shubhajit Roy
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 414 - 417
  • [33] ASIC implementation of a unified hardware architecture for non-key based cryptographic hash primitives
    Ganesh, TS
    Sudarshan, TSB
    ITCC 2005: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 1, 2005, : 580 - 585
  • [34] Quantized CNN-based efficient hardware architecture for real-time hand gesture recognition
    Jaiswal, Mohita
    Sharma, Vaidehi
    Sharma, Abhishek
    Saini, Sandeep
    Tomar, Raghuvir
    MICROELECTRONICS JOURNAL, 2024, 151
  • [35] CNN with depthwise separable convolutions and combined kernels for rating prediction
    Khan, Zahid Younas
    Niu, Zhendong
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170
  • [36] Retinal architecture in CNN
    Werblin, FS
    CNNA 98 - 1998 FIFTH IEEE INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS - PROCEEDINGS, 1998, : 11 - 12
  • [37] Hardware Trojan in FPGA CNN Accelerator
    Ye, Jing
    Hu, Yu
    Li, Xiaowei
    2018 IEEE 27TH ASIAN TEST SYMPOSIUM (ATS), 2018, : 68 - 73
  • [38] Heterogeneous Edge CNN Hardware Accelerator
    Moudgill, Mayan
    Glossner, John
    Huang, Wei
    Tian, Chaoyang
    Xu, Chunxia
    Yang, Nianliang
    Wang, Lei
    Liang, Tailin
    Shi, Shaobo
    Zhang, Xiaodong
    Iancu, Daniel
    Nacer, Gary
    Li, Kerry
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 636 - 641
  • [39] CoNNA - Compressed CNN Hardware Accelerator
    Struharik, Rastislav
    Vukobratovic, Bogdan
    Erdeljan, Andrea
    Rakanovic, Damjan
    2018 21ST EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2018), 2018, : 365 - 372
  • [40] A unified construction of product formulas and convolutions for Sturm–Liouville operators
    Rúben Sousa
    Manuel Guerra
    Semyon Yakubovich
    Analysis and Mathematical Physics, 2021, 11