An FPGA-based Accelerator Implementation for Deep Convolutional Neural Networks

被引：0

作者：

Zhou, Yongmei ^{[1
]}

Jiang, Jingfei ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China

来源：

PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015) | 2015年

关键词：

FPGA; Convolutional Neural Network; fixed-point arithmetic; HLS;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep convolutional neural networks (CNN) is highly efficient in image recognition tasks such as MNIST digit recognition. Accelerators based on FPGA platform are proposed since general purpose processor is disappointing in terms of performance when dealing with recognition tasks. Recently, an optimized FPGA-based accelerator design (work 1) has been proposed claiming best performance compared with existing implementations. But as the author acknowledged, performance could be better if fixed point presentation and computation elements had been used. Inspired by its methodology in implementing the Alexnet convolutional neural network, we implement a 5-layer accelerator for MNIST digit recognition task using the same Vivado HLS tool but using 11-bits fixed point precision on a Virtex7 FPGA. We compare performance on FPGA platform with the performance of the target CNN on MATLAB/CPU platform; we reach a speedup of 16.42. Our implementation runs at 150MHz and reaches a peak performance of 16.58 GMACS. Since our target CNN is simpler, we use much less resource than work 1 has used.

引用

页码：829 / 832

页数：4

共 50 条

[41] Throughput-Optimized FPGA Accelerator for Deep Convolutional Neural Networks
Liu, Zhiqiang
Dou, Yong
Jiang, Jingfei
Xu, Jinwei
Li, Shijie
Zhou, Yongmei
Xu, Yingnan
[J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2017, 10 (03)
[42] OPU: An FPGA-Based Overlay Processor for Convolutional Neural Networks
Yu, Yunxuan
Wu, Chen
Zhao, Tiandong
Wang, Kun
He, Lei
[J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 35 - 47
[43] A Scalable FPGA Accelerator for Convolutional Neural Networks
Xu, Ke
Wang, Xiaoyun
Fu, Shihang
Wang, Dong
[J]. ADVANCED COMPUTER ARCHITECTURE, 2018, 908 : 3 - 14
[44] VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator
Hamdan, Muhammad K.
Rover, Diane T.
[J]. 2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2017,
[45] An FPGA-based accelerator for deep neural network with novel reconfigurable architecture
Jia, Han
Ren, Daming
Zou, Xuecheng
[J]. IEICE ELECTRONICS EXPRESS, 2021, 18 (04):
[46] Acceleration and implementation of convolutional neural networks based on FPGA
Zhao, Sijie
Gao, Shangshang
Wang, Rugang
Wang, Yuanyuan
Zhou, Feng
Guo, Naihong
[J]. DIGITAL SIGNAL PROCESSING, 2023, 141
[47] Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights
Duan, Yunzhi
Li, Shuai
Zhang, Ruipeng
Wang, Qi
Chen, Jienan
Sobelman, Gerald E.
[J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
[48] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
Fan, Hongxiang
Ferianc, Martin
Rodrigues, Miguel
Zhou, Hongyu
Niu, Xinyu
Luk, Wayne
[J]. 2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
[49] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
Irmak, Hasan
Alachiotis, Nikolaos
Ziener, Daniel
[J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[50] Efficient FPGA-Based Convolutional Neural Network Implementation for Edge Computing
Cuong, Pham-Quoc
Thinh, Tran Ngoc
[J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (03) : 479 - 487

← 1 2 3 4 5 →