A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN

被引：0

作者：

Bai, Lin ^{[1
]}

Lyu, Yecheng ^{[1
]}

Huang, Xinming ^{[1
]}

机构：

[1] Worcester Polytech Inst, Worcester, MA 01609 USA

来源：

2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2020年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deconvolution plays an important role in the state-of-the-art convolutional neural networks (CNNs) for the tasks like semantic segmentation, image super resolution, etc. In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum. As an example, SegNet-Basic has been implemented using the proposed unified architecture by targeting on Xilinx ZC706 FPGA, which achieves the performance of 151.5 GOPS and 94.3 GOPS for convolution and deconvolution respectively. This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.

引用

页数：5

共 50 条

[11] Contextual-CNN: A Novel Architecture Capturing Unified Meaning for Sentence Classification
Shin, Joongbo
Kim, Yanghoon
Yoon, Seunghyun
Jung, Kyomin
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 491 - 494
[12] Nonlinear CNN: improving CNNs with quadratic convolutions
Jiang, Yiyang
Yang, Fan
Zhu, Hengliang
Zhou, Dian
Zeng, Xuan
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8507 - 8516
[13] Nonlinear CNN: improving CNNs with quadratic convolutions
Yiyang Jiang
Fan Yang
Hengliang Zhu
Dian Zhou
Xuan Zeng
Neural Computing and Applications, 2020, 32 : 8507 - 8516
[14] Hardware implementation of CNN
Veni, S
Yamuna, B
2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 320 - 325
[15] Unified hardware architecture for 128-bit block ciphers AES and Camellia
Satoh, A
Morioka, S
CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS CHES 2003, PROCEEDINGS, 2003, 2779 : 304 - 318
[16] A Unified Parallel CORDIC-Based Hardware Architecture for LSTM Network Acceleration
Mohamed, Nadya A.
Cavallaro, Joseph R.
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2752 - 2766
[17] Skipping CNN Convolutions Through Efficient Memoization
de Moura, Rafael Fao
Santos, Paulo C.
de Lima, Joao Paulo C.
Alves, Marco A. Z.
Beck, Antonio C. S.
Carro, Luigi
EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 65 - 76
[18] Mini Pool : Pooling hardware architecture using minimized local memory for CNN accelerators
Lee, Eunchong
Lee, Sang-Seol
Sung, Minyong
Jang, Sung-Joon
Choi, Byoung-Ho
2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
[19] DiMO-Sparse: Differentiable Modeling and Optimization of Sparse CNN Dataflow and Hardware Architecture
Song, Jianfeng
Liang, Rongjian
Gong, Yu
Yuan, Bo
Hu, Jiang
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[20] HFNet: A CNN Architecture Co-designed for Neuromorphic Hardware With a Crossbar Array of Synapses
Gopalakrishnan, Roshan
Chua, Yansong
Sun, Pengfei
Sreejith Kumar, Ashish Jith
Basu, Arindam
FRONTIERS IN NEUROSCIENCE, 2020, 14

← 1 2 3 4 5 →