Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

被引:13
|
作者
He, Zhezhi [1 ]
Angizi, Shaahin [1 ]
Fan, Deliang [1 ]
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32816 USA
基金
美国国家科学基金会;
关键词
Neural network acceleration; In-memory computing; Magnetic Random Access Memory;
D O I
10.1109/ISVLSI.2018.00103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolution Neural Network (CNN) has achieved outstanding performance in image recognition over large scale dataset. However, pursuit of higher inference accuracy leads to CNN architecture with deeper layers and denser connections, which inevitably makes its hardware implementation demand more and more memory and computational resources. It can be interpreted as 'CNN power and memory wall'. Recent research efforts have significantly reduced both model size and computational complexity by using low bit-width weights, activations and gradients, while keeping reasonably good accuracy. In this work, we present different emerging nonvolatile Magnetic Random Access Memory (MRAM) designs that could be leveraged to implement 'bit-wise in-memory convolution engine', which could simultaneously store network parameters and compute low bit-width convolution. Such new computing model leverages the 'in-memory computing' concept to accelerate CNN inference and reduce convolution energy consumption due to intrinsic logic-in-memory design and reduction of data communication.
引用
收藏
页码:533 / 538
页数:6
相关论文
共 50 条
  • [31] Speedup deep learning models on GPU by taking advantage of efficient unstructured pruning and bit-width reduction
    Pietron, Marcin
    Zurek, Dominik
    Sniezynski, Bartlomiej
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 67
  • [32] Smart bit-width allocation for low power optimization in a SystemC based ASIC design environment
    Mallik, Arindam
    Sinha, Debjit
    Banerjee, Prith
    Zhou, Hai
    [J]. 2006 DESIGN AUTOMATION AND TEST IN EUROPE, VOLS 1-3, PROCEEDINGS, 2006, : 616 - +
  • [33] Floating-point bit-width optimization for low-power signal processing applications
    Fang, F
    Chen, TH
    Rutenbar, RA
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3208 - 3211
  • [34] ABS: Accumulation Bit-Width Scaling Method for Designing Low-Precision Tensor Core
    Cao, Yasong
    Wen, Mei
    Luo, Zhongdi
    Ju, Xin
    Huang, Haolan
    Shen, Junzhong
    Chen, Haiyan
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (09) : 1590 - 1601
  • [35] Low Latency Angle Recoding Methods for the Higher Bit-Width Parallel CORDIC Rotator Implementations
    Juang, Tso-Bing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2008, 55 (11) : 1139 - 1143
  • [36] Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures
    Han, Qingchang
    Hu, Yongmin
    Yu, Fengwei
    Yang, Hailong
    Liu, Bing
    Hu, Peng
    Gong, Ruihao
    Wang, Yanfei
    Wang, Rui
    Luan, Zhongzhi
    Qian, Depei
    [J]. PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [37] Deep convolution neural network for image recognition
    Traore, Boukaye Boubacar
    Kamsu-Foguem, Bernard
    Tangara, Fana
    [J]. ECOLOGICAL INFORMATICS, 2018, 48 : 257 - 268
  • [38] Deep Rotating Kernel Convolution Neural Network
    Shin, Crino
    Yun, Jongpil
    [J]. 2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 441 - 442
  • [39] Performance Analysis of Bit-Width Reduced Floating-Point Arithmetic Units in FPGAs: A Case Study of Neural Network-Based Face Detector
    Lee, Yongsoon
    Choi, Younhee
    Ko, Seok-Bum
    Lee, Moon Ho
    [J]. EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2009, (01)
  • [40] Emotional design of bamboo chair based on deep convolution neural network and deep convolution generative adversarial network
    Kang, Xinhui
    Nagasawa, Shin'ya
    Wu, Yixiang
    Xiong, Xingfu
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 1977 - 1989