Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

被引:13
|
作者
He, Zhezhi [1 ]
Angizi, Shaahin [1 ]
Fan, Deliang [1 ]
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32816 USA
基金
美国国家科学基金会;
关键词
Neural network acceleration; In-memory computing; Magnetic Random Access Memory;
D O I
10.1109/ISVLSI.2018.00103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolution Neural Network (CNN) has achieved outstanding performance in image recognition over large scale dataset. However, pursuit of higher inference accuracy leads to CNN architecture with deeper layers and denser connections, which inevitably makes its hardware implementation demand more and more memory and computational resources. It can be interpreted as 'CNN power and memory wall'. Recent research efforts have significantly reduced both model size and computational complexity by using low bit-width weights, activations and gradients, while keeping reasonably good accuracy. In this work, we present different emerging nonvolatile Magnetic Random Access Memory (MRAM) designs that could be leveraged to implement 'bit-wise in-memory convolution engine', which could simultaneously store network parameters and compute low bit-width convolution. Such new computing model leverages the 'in-memory computing' concept to accelerate CNN inference and reduce convolution energy consumption due to intrinsic logic-in-memory design and reduction of data communication.
引用
收藏
页码:533 / 538
页数:6
相关论文
共 50 条
  • [1] Bit-width Adaptive Accelerator Design for Convolution Neural Network
    Guo, Jianxin
    Yin, Shouyi
    Ouyang, Peng
    Tu, Fengbin
    Tang, Shibin
    Liu, Leibo
    Wei, Shaojun
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [2] Accelerating Low Bit-Width Convolutional Neural Networks With Embedded FPGA
    Jiao, Li
    Luo, Cheng
    Cao, Wei
    Zhou, Xuegong
    Wang, Lingli
    [J]. 2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [3] Low Bit-Width Convolutional Neural Network on RRAM
    Cai, Yi
    Tang, Tianqi
    Xia, Lixue
    Li, Boxun
    Wang, Yu
    Yang, Huazhong
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (07) : 1414 - 1427
  • [4] Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
    Wang, Junsong
    Lou, Qiuwen
    Zhang, Xiaofan
    Zhu, Chao
    Lin, Yonghua
    Chen, Deming
    [J]. 2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 163 - 169
  • [5] Accelerating Low Bit-width Neural Networks at the Edge, PIM or FPGA: A Comparative Study
    Kochar, Nakul
    Ekiert, Lucas
    Najafi, Deniz
    Fan, Deliang
    Angizi, Shaahin
    [J]. PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2023, GLSVLSI 2023, 2023, : 625 - 630
  • [6] Residual Quantization for Low Bit-Width Neural Networks
    Li, Zefan
    Ni, Bingbing
    Yang, Xiaokang
    Zhang, Wenjun
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 214 - 227
  • [7] A Low Bit-width Parameter Representation Method for Hardware-oriented Convolution Neural Networks
    Chen, Qiang
    Xin, Chen
    Zou, Chenglong
    Wang, Xinan
    Wang, Bo
    [J]. 2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 148 - 151
  • [8] Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks
    Tuan Hoang
    Thanh-Toan Do
    Nguyen, Tam, V
    Cheung, Ngai-Man
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2111 - 2118
  • [9] Combinatorial optimization for low bit-width neural networks
    Zhou, Han
    Ashrafi, Aida
    Blaschko, Matthew B.
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2246 - 2252
  • [10] Bit-width Reduction and Customized Register for Low Cost Convolutional Neural Network Accelerator
    Choi, Kyungrak
    Choi, Woong
    Shin, Kyungho
    Park, Jongsun
    [J]. 2017 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2017,