Efficient Fast Convolution Architectures for Convolutional Neural Network

被引：0

作者：

Xu, Weihong ^{[1
,2
]}

Wang, Zhongfeng ^{[3
]}

You, Xiaohu ^{[2
]}

Zhang, Chuan ^{[1
,2
]}

机构：

[1] Lab Efficient Architectures Digital Commun & Sign, Nanjing, Jiangsu, Peoples R China

[2] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing, Jiangsu, Peoples R China

[3] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Jiangsu, Peoples R China

来源：

2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON) | 2017年

关键词：

Fast convolution; parallel processing; data reuse; hardware reconfigurability; convolutional neural network (CNN);

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to the world-wide interests on artificial intelligence, many acceleration architectures for convolutional neural network (CNN) have been proposed recently. However, few of them focus on reducing convolution computation strength. In this paper, we first present fast convolution algorithm and its matrix form. Then based on the fast convolution algorithm, a fully parallel architecture with high throughput is proposed. To further increase efficiency and reduce computation redundancy, output data reuse scheme corresponding to CNN is also considered by introducing affordable adders and buffers. The hardware implementation and complexity comparison are conducted among different convolution architectures. Implementation results on Zynq XC7Z045 platform demonstrate the effectiveness of proposed fast convolution architectures in the reduction of complexity. Compared to conventional 2-D convolver, our 3 parallel fast convolution filter reduces 28% hardware resources and improves throughput by 17%. After deploying data reuse scheme, our fast convolution architecture is 10.56x faster.

引用

页码：904 / 907

页数：4

共 50 条

[1] Efficient Convolution Architectures for Convolutional Neural Network
Wang, Jichen
Lin, Jun
Wang, Zhongfeng
[J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
[2] Bandwidth Efficient Architectures for Convolutional Neural Network
Wang, Jichen
Lin, Jun
Wang, Zhongfeng
[J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 94 - 99
[3] Efficient Hardware Architectures for Deep Convolutional Neural Network
Wang, Jichen
Lin, Jun
Wang, Zhongfeng
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (06) : 1941 - 1953
[4] Fast Convolution Algorithm for Convolutional Neural Networks
Kim, Tae Sun
Bae, Jihoon
Sunwoo, Myung Hoon
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 258 - 261
[5] Optimizing Convolutional Neural Network Architectures
Balderas, Luis
Lastra, Miguel
Benitez, Jose M.
[J]. MATHEMATICS, 2024, 12 (19)
[6] Coupled convolution layer for convolutional neural network
Uchida, Kazutaka
Tanaka, Masayuki
Okutomi, Masatoshi
[J]. NEURAL NETWORKS, 2018, 105 : 197 - 205
[7] Coupled Convolution Layer for Convolutional Neural Network
Uchida, Kazutaka
Tanaka, Masayuki
Okutomi, Masatoshi
[J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3548 - 3553
[8] A review of convolutional neural network architectures and their optimizations
Cong, Shuang
Zhou, Yang
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 1905 - 1969
[9] A review of convolutional neural network architectures and their optimizations
Shuang Cong
Yang Zhou
[J]. Artificial Intelligence Review, 2023, 56 : 1905 - 1969
[10] DSC-Ghost-Conv: A compact convolution module for building efficient neural network architectures
Tao Wang
Shiqing Zhang
[J]. Multimedia Tools and Applications, 2024, 83 : 36767 - 36795

← 1 2 3 4 5 →