Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks

被引：0

作者：

Boulasikis, M. A. ^{[1
]}

Birbas, M. ^{[1
]}

Tsafas, N. ^{[1
]}

Kanakaris, N. ^{[1
]}

机构：

[1] Univ Patras, Dept Elect & Comp Engn, Patras, Greece

来源：

2021 10TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST) | 2021年

关键词：

Deep Learning; Hardware Optimizations; Embedded Systems; Computer Vision; Quantization; System Level Design;

D O I：

10.1109/MOCAST52088.2021.9493366

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Recent advances in the field of computer vision create the demand for larger and more complex architectures for Deep Convolutional Neural Networks (CNNs). As a result, computation time and memory usage become the main bottleneck in applied deep network inference, particularly in Embedded Systems implementations. Parameter quantization is often employed in these cases to minimize the detrimental effect of the aforementioned bottlenecks. In this paper, low level hardware optimizations on fixed point convolution are considered. Emphasis is given on the utilization of Digital Signal Processing Units (DSPs) as dual multipliers and on practical considerations. Three-by-three convolution kernels are formulated based on this research and are measured as case studies. The experiments show that the proper exploitation of dual multipliers can offer significant benefits to the system.

引用

页数：5

共 50 条

[1] Efficient Implementation of Convolutional Neural Networks on FPGA
Hadnagy, A.
Feher, B.
Kovacshazy, T.
[J]. 2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
[2] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
Podili, Abhinav
Zhang, Chi
Prasanna, Viktor
[J]. 2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
[3] Efficient Design of Pruned Convolutional Neural Networks on FPGA
Vestias, Mario
[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 531 - 544
[4] Efficient Design of Pruned Convolutional Neural Networks on FPGA
Mário Véstias
[J]. Journal of Signal Processing Systems, 2021, 93 : 531 - 544
[5] Efficient Mitchell's Approximate Log Multipliers for Convolutional Neural Networks
Kim, Min Soo
Del Barrio, Alberto A.
Oliveira, Leonardo Tavares
Hermida, Roman
Bagherzadeh, Nader
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (05) : 660 - 675
[6] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
Hwang, Wen-Jyi
Jhang, Yun-Jie
Tai, Tsung-Ming
[J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
[7] An Efficient Hardware Accelerator for Block Sparse Convolutional Neural Networks on FPGA
Yin, Xiaodi
Wu, Zhipeng
Li, Dejian
Shen, Chongfei
Liu, Yu
[J]. IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (02) : 158 - 161
[8] Accelerator Design with Effective Resource Utilization for Binary Convolutional Neural Networks on an FPGA
Kim, Sunwoong
Rutenbar, Rob A.
[J]. PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 218 - 218
[9] Efficient Inference of Large-Scale and Lightweight Convolutional Neural Networks on FPGA
Wu, Xiao
Ma, Yufei
Wang, Zhongfeng
[J]. 2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 168 - 173
[10] An Efficient Convolutional Neural Network Accelerator on FPGA
Si, Junye
Jiang, Jianfei
Wang, Qin
Huang, Jia
[J]. 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1392 - 1394

← 1 2 3 4 5 →