Design of high performance and energy efficient convolution array for convolution neural network-based image inference engine

被引:1
|
作者
Deepika, S. [1 ]
Arunachalam, V. [1 ]
机构
[1] Vellore Inst Technol, Dept Micro & Nanoelect, Vellore 632014, India
关键词
Convolutional neural network; Edge computing; Energy efficient convolution array; Inference engine; Sparse accelerator; ACCELERATOR;
D O I
10.1016/j.engappai.2023.106953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The energy efficiency of CNN-based inference engines predominately depends upon Giga-operations-per-second and power consumption. The sparse-based accelerator compresses the insignificant inputs (input feature maps & weights), skips the inefficient computations, and improves energy efficiency. A sparse accelerator for weights could impact the accuracy of the inference. Therefore, a sparse network for Input Feature Maps (IFMs) is considered. MATLAB-based sparsity analysis is done layer-wise on the pre-trained CNN models like AlexNet, VGG-16, VGG-19, ResNet-18 & ResNet-34. Layer-wise analysis reveals that similar to 18%-90% of the IFMs are zeros. Besides, IFMs and Weights adopted the 16-bit Fix/Float data format to maintain an accuracy as close as 97% with Single Precision Floating Point (SPFP). A 3 x 1 Convolutional array with improved Zero-detect-Skip (CZS(3x1)) control units for multiplier and adder/subtractor arrays is proposed. The modified rectified linear unit (RELU) converts IFM values <= 0 to zero and sets Detection-Bit (DB) to 1. These DBs decide the mode of effective zero-skip operations in CZS(3x1). A 3 x 3 Compressed Processing Element (CPE) is designed using three CZS(3x1). The 20-CPEs convolution array architecture is implemented in 65 nm technology libraries. The performance of 90 Giga-operations per second (GOP/s) and energy efficiency of 3.42 Tera operations per second per watt (TOPS/W) were attained for the proposed CPE. The CPE with improved control strategy enhanced the performance by a factor of 2.45 while consuming 8.8 times less energy on average than the state-of-the-art CNN accelerators.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Research on High Resolution Remote Sensing Image Classification Based on Convolution Neural Network
    Gong, Wenwen
    Wang, Zhuqing
    Liang, Yong
    Fan, Xin
    Hao, Junmeng
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE XI, PT I, 2019, 545 : 87 - 97
  • [32] An Efficient Method of Histological Cell Image Detection Based on Spatial Information Convolution Neural Network
    Qiang, Qi
    Hong, Wang
    Likang, Peng
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 69 - 73
  • [33] Emotional design of bamboo chair based on deep convolution neural network and deep convolution generative adversarial network
    Kang, Xinhui
    Nagasawa, Shin'ya
    Wu, Yixiang
    Xiong, Xingfu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 1977 - 1989
  • [34] Design Method for an LUT Network-Based CNN with a Sparse Local Convolution
    Soga, Naoto
    Nakahara, Hiroki
    2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 294 - 295
  • [35] IMCE: Energy-Efficient Bit-Wise In-Memory Convolution Engine for Deep Neural Network
    Angizi, Shaahin
    He, Zhezhi
    Parveen, Farhana
    Fan, Deliang
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 111 - 116
  • [36] Design of Convolutional Neural Network Based on Reticulated Convolution Module
    Li Daihui
    Yang Lei
    Zeng Shangyou
    Ma Chengxu
    PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 256 - 259
  • [37] Performance Optimization of Neural Network Convolution Based on GPU Platform
    Li M.
    Qu G.
    Wei D.
    Jia H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (06): : 1181 - 1191
  • [38] An efficient image dahazing using Googlenet based convolution neural networks
    Harish Babu G
    Venkatram N
    Multimedia Tools and Applications, 2022, 81 : 43897 - 43917
  • [39] An efficient image dahazing using Googlenet based convolution neural networks
    Babu, Harish G.
    Venkatram, N.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43897 - 43917
  • [40] Deep Convolution Neural Network-Based Crack Feature Extraction, Detection and Quantification
    Shuai Teng
    Gongfa Chen
    Journal of Failure Analysis and Prevention, 2022, 22 : 1308 - 1321