A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN

被引:0
|
作者
Bai, Lin [1 ]
Lyu, Yecheng [1 ]
Huang, Xinming [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deconvolution plays an important role in the state-of-the-art convolutional neural networks (CNNs) for the tasks like semantic segmentation, image super resolution, etc. In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum. As an example, SegNet-Basic has been implemented using the proposed unified architecture by targeting on Xilinx ZC706 FPGA, which achieves the performance of 151.5 GOPS and 94.3 GOPS for convolution and deconvolution respectively. This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] Contextual-CNN: A Novel Architecture Capturing Unified Meaning for Sentence Classification
    Shin, Joongbo
    Kim, Yanghoon
    Yoon, Seunghyun
    Jung, Kyomin
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 491 - 494
  • [12] Nonlinear CNN: improving CNNs with quadratic convolutions
    Jiang, Yiyang
    Yang, Fan
    Zhu, Hengliang
    Zhou, Dian
    Zeng, Xuan
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8507 - 8516
  • [13] Nonlinear CNN: improving CNNs with quadratic convolutions
    Yiyang Jiang
    Fan Yang
    Hengliang Zhu
    Dian Zhou
    Xuan Zeng
    Neural Computing and Applications, 2020, 32 : 8507 - 8516
  • [14] Hardware implementation of CNN
    Veni, S
    Yamuna, B
    2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 320 - 325
  • [15] Unified hardware architecture for 128-bit block ciphers AES and Camellia
    Satoh, A
    Morioka, S
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS CHES 2003, PROCEEDINGS, 2003, 2779 : 304 - 318
  • [16] A Unified Parallel CORDIC-Based Hardware Architecture for LSTM Network Acceleration
    Mohamed, Nadya A.
    Cavallaro, Joseph R.
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2752 - 2766
  • [17] Skipping CNN Convolutions Through Efficient Memoization
    de Moura, Rafael Fao
    Santos, Paulo C.
    de Lima, Joao Paulo C.
    Alves, Marco A. Z.
    Beck, Antonio C. S.
    Carro, Luigi
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 65 - 76
  • [18] Mini Pool : Pooling hardware architecture using minimized local memory for CNN accelerators
    Lee, Eunchong
    Lee, Sang-Seol
    Sung, Minyong
    Jang, Sung-Joon
    Choi, Byoung-Ho
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [19] DiMO-Sparse: Differentiable Modeling and Optimization of Sparse CNN Dataflow and Hardware Architecture
    Song, Jianfeng
    Liang, Rongjian
    Gong, Yu
    Yuan, Bo
    Hu, Jiang
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [20] HFNet: A CNN Architecture Co-designed for Neuromorphic Hardware With a Crossbar Array of Synapses
    Gopalakrishnan, Roshan
    Chua, Yansong
    Sun, Pengfei
    Sreejith Kumar, Ashish Jith
    Basu, Arindam
    FRONTIERS IN NEUROSCIENCE, 2020, 14