Design of high performance and energy efficient convolution array for convolution neural network-based image inference engine

被引：1

作者：

Deepika, S. ^{[1
]}

Arunachalam, V. ^{[1
]}

机构：

[1] Vellore Inst Technol, Dept Micro & Nanoelect, Vellore 632014, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 126卷

关键词：

Convolutional neural network; Edge computing; Energy efficient convolution array; Inference engine; Sparse accelerator; ACCELERATOR;

D O I：

10.1016/j.engappai.2023.106953

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The energy efficiency of CNN-based inference engines predominately depends upon Giga-operations-per-second and power consumption. The sparse-based accelerator compresses the insignificant inputs (input feature maps & weights), skips the inefficient computations, and improves energy efficiency. A sparse accelerator for weights could impact the accuracy of the inference. Therefore, a sparse network for Input Feature Maps (IFMs) is considered. MATLAB-based sparsity analysis is done layer-wise on the pre-trained CNN models like AlexNet, VGG-16, VGG-19, ResNet-18 & ResNet-34. Layer-wise analysis reveals that similar to 18%-90% of the IFMs are zeros. Besides, IFMs and Weights adopted the 16-bit Fix/Float data format to maintain an accuracy as close as 97% with Single Precision Floating Point (SPFP). A 3 x 1 Convolutional array with improved Zero-detect-Skip (CZS(3x1)) control units for multiplier and adder/subtractor arrays is proposed. The modified rectified linear unit (RELU) converts IFM values <= 0 to zero and sets Detection-Bit (DB) to 1. These DBs decide the mode of effective zero-skip operations in CZS(3x1). A 3 x 3 Compressed Processing Element (CPE) is designed using three CZS(3x1). The 20-CPEs convolution array architecture is implemented in 65 nm technology libraries. The performance of 90 Giga-operations per second (GOP/s) and energy efficiency of 3.42 Tera operations per second per watt (TOPS/W) were attained for the proposed CPE. The CPE with improved control strategy enhanced the performance by a factor of 2.45 while consuming 8.8 times less energy on average than the state-of-the-art CNN accelerators.

引用

页数：13

共 50 条

[31] Research on High Resolution Remote Sensing Image Classification Based on Convolution Neural Network
Gong, Wenwen
Wang, Zhuqing
Liang, Yong
Fan, Xin
Hao, Junmeng
COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE XI, PT I, 2019, 545 : 87 - 97
[32] An Efficient Method of Histological Cell Image Detection Based on Spatial Information Convolution Neural Network
Qiang, Qi
Hong, Wang
Likang, Peng
ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 69 - 73
[33] Emotional design of bamboo chair based on deep convolution neural network and deep convolution generative adversarial network
Kang, Xinhui
Nagasawa, Shin'ya
Wu, Yixiang
Xiong, Xingfu
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 1977 - 1989
[34] Design Method for an LUT Network-Based CNN with a Sparse Local Convolution
Soga, Naoto
Nakahara, Hiroki
2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 294 - 295
[35] IMCE: Energy-Efficient Bit-Wise In-Memory Convolution Engine for Deep Neural Network
Angizi, Shaahin
He, Zhezhi
Parveen, Farhana
Fan, Deliang
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 111 - 116
[36] Design of Convolutional Neural Network Based on Reticulated Convolution Module
Li Daihui
Yang Lei
Zeng Shangyou
Ma Chengxu
PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 256 - 259
[37] Performance Optimization of Neural Network Convolution Based on GPU Platform
Li M.
Qu G.
Wei D.
Jia H.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (06): : 1181 - 1191
[38] An efficient image dahazing using Googlenet based convolution neural networks
Harish Babu G
Venkatram N
Multimedia Tools and Applications, 2022, 81 : 43897 - 43917
[39] An efficient image dahazing using Googlenet based convolution neural networks
Babu, Harish G.
Venkatram, N.
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43897 - 43917
[40] Deep Convolution Neural Network-Based Crack Feature Extraction, Detection and Quantification
Shuai Teng
Gongfa Chen
Journal of Failure Analysis and Prevention, 2022, 22 : 1308 - 1321

← 1 2 3 4 5 →