Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

被引:13
|
作者
He, Zhezhi [1 ]
Angizi, Shaahin [1 ]
Fan, Deliang [1 ]
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32816 USA
基金
美国国家科学基金会;
关键词
Neural network acceleration; In-memory computing; Magnetic Random Access Memory;
D O I
10.1109/ISVLSI.2018.00103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolution Neural Network (CNN) has achieved outstanding performance in image recognition over large scale dataset. However, pursuit of higher inference accuracy leads to CNN architecture with deeper layers and denser connections, which inevitably makes its hardware implementation demand more and more memory and computational resources. It can be interpreted as 'CNN power and memory wall'. Recent research efforts have significantly reduced both model size and computational complexity by using low bit-width weights, activations and gradients, while keeping reasonably good accuracy. In this work, we present different emerging nonvolatile Magnetic Random Access Memory (MRAM) designs that could be leveraged to implement 'bit-wise in-memory convolution engine', which could simultaneously store network parameters and compute low bit-width convolution. Such new computing model leverages the 'in-memory computing' concept to accelerate CNN inference and reduce convolution energy consumption due to intrinsic logic-in-memory design and reduction of data communication.
引用
收藏
页码:533 / 538
页数:6
相关论文
共 50 条
  • [41] Low Dose CT Lung Denoising Model Based on Deep Convolution Neural Network
    Lu Xiaoqi
    Wu Liang
    Gu Yu
    Zhang Ming
    Li Jing
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (06) : 1353 - 1359
  • [42] Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
    Sharma, Hardik
    Park, Jongse
    Suda, Naveen
    Lai, Liangzhen
    Chau, Benson
    Chandra, Vikas
    Esmaeilzadeh, Hadi
    [J]. 2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 764 - 775
  • [43] IMCE: Energy-Efficient Bit-Wise In-Memory Convolution Engine for Deep Neural Network
    Angizi, Shaahin
    He, Zhezhi
    Parveen, Farhana
    Fan, Deliang
    [J]. 2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 111 - 116
  • [44] Commodity Bit-Cell Sponsored MRAM Interaction Design for Binary Neural Network
    Cai, Hao
    Bian, Zhongjian
    Fan, Zhonghua
    Liu, Bo
    Naviner, Lirida
    [J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2022, 69 (04) : 1721 - 1726
  • [45] Neurons Detection Employing a Deep Convolution Neural Network
    Hai-Dang To
    Thanh-Hung Nguyen
    Huu-Long Nguyen
    [J]. PROCEEDINGS OF THE 3RD ANNUAL INTERNATIONAL CONFERENCE ON MATERIAL, MACHINES AND METHODS FOR SUSTAINABLE DEVELOPMENT, VOL 2, MMMS 2022, 2024, : 345 - 351
  • [46] Deep Separable Convolution Neural Network for Illumination Estimation
    Wang, Minquan
    Shang, Zhaowei
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 879 - 886
  • [47] Review of Deep Convolution Neural Network in Image Classification
    Al-Saffar, Ahmed Ali Mohammed
    Tao, Hai
    Talab, Mohammed Ahmed
    [J]. 2017 INTERNATIONAL CONFERENCE ON RADAR, ANTENNA, MICROWAVE, ELECTRONICS, AND TELECOMMUNICATIONS (ICRAMET), 2017, : 26 - 31
  • [48] Semantic Segmentation Based on Deep Convolution Neural Network
    Shan, Jichao
    Li, Xiuzhi
    Jia, Songmin
    Zhang, Xiangyin
    [J]. 3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [49] Encoding temporal information in deep convolution neural network
    Singh, Avinash Kumar
    Bianchi, Luigi
    [J]. FRONTIERS IN NEUROERGONOMICS, 2024, 5
  • [50] Low-power optimization by smart bit-width allocation in a SystemC-based ASIC design environment
    Mallik, Arindam
    Sinha, Debjit
    Banerjee, Prithviraj
    Zhou, Hai
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2007, 26 (03) : 447 - 455