A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN

被引:0
|
作者
Bai, Lin [1 ]
Lyu, Yecheng [1 ]
Huang, Xinming [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deconvolution plays an important role in the state-of-the-art convolutional neural networks (CNNs) for the tasks like semantic segmentation, image super resolution, etc. In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum. As an example, SegNet-Basic has been implemented using the proposed unified architecture by targeting on Xilinx ZC706 FPGA, which achieves the performance of 151.5 GOPS and 94.3 GOPS for convolution and deconvolution respectively. This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] HARDWARE ARCHITECTURE OF A UNIFIED EMBEDDED ENGINEERING LEARNING PLATFORM
    Dukic, Jovan
    Bogic, Stevan
    Dimovski, Dusanka
    Pilipovic, Milos
    2013 21ST TELECOMMUNICATIONS FORUM (TELFOR), 2013, : 537 - +
  • [2] Customizing CMOS/ReRAM Hybrid Hardware Architecture for Spiking CNN
    Xue, Tianzhi
    Liu, Baicheng
    Sun, Wenhao
    Chen, Song
    Kang, Yi
    Wu, Feng
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [3] Design of Flexible Hardware Accelerators for Image Convolutions and Transposed Convolutions
    Sestito, Cristian
    Spagnolo, Fanny
    Perri, Stefania
    JOURNAL OF IMAGING, 2021, 7 (10)
  • [4] A Unified and Pipelined Hardware Architecture for Implementing Intra Prediction in HEVC
    Jiang, Yuebing
    Liamocca, Daniel
    Pattichis, Marios
    Esakki, Gangadharan
    2014 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI 2014), 2014, : 29 - 32
  • [5] A Max Pooling Hardware Architecture Supporting Inference And Training For CNN Accelerators
    Kim, Sanghyun
    Lee, Eunchong
    Lee, Minkyu
    Kim, Kyungho
    Lee, Sang-Seol
    Jang, Sung-Joon
    2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 313 - 314
  • [6] Hardware Efficient Modified CNN Architecture for Traffic Sign Detection and Recognition
    Vaidya, Bhaumik
    Paunwala, Chirag
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2022, 22 (02)
  • [7] EXPLORING HARDWARE FRIENDLY BOTTLENECK ARCHITECTURE IN CNN FOR EMBEDDED COMPUTING SYSTEMS
    Lei, Xing
    Liu, Longjun
    Zhou, Zhiheng
    Sun, Hongbin
    Zheng, Nanning
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4180 - 4184
  • [8] UniWiG: Unified Winograd-GEMM Architecture for Accelerating CNN on FPGAs
    Kala, S.
    Mathew, Jimson
    Jose, Babita R.
    Nalesh, S.
    2019 32ND INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2019 18TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2019, : 209 - 214
  • [9] A Low-Power Hardware Architecture for Real-Time CNN Computing
    Liu, Xinyu
    Cao, Chenhong
    Duan, Shengyu
    SENSORS, 2023, 23 (04)
  • [10] Hardware implementation of 1D-CNN architecture for ECG arrhythmia classification
    Rawal, Viraj
    Prajapati, Priyank
    Darji, Anand
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85