An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices

被引:11
|
作者
Elbtity, Mohammed [1 ]
Singh, Abhishek [2 ]
Reidy, Brendan [1 ]
Guo, Xiaochen [2 ]
Zand, Ramtin [1 ]
机构
[1] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Lehigh Univ, Dept Elect & Comp Engn, Bethlehem, PA 18015 USA
关键词
in-memory computing; magnetic random access memory (MRAM); convolutional neural networks (CNNs); mixed-precision and mixed-signal inference;
D O I
10.1109/ISVLSI51109.2021.00043
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we develop an in-memory analog computing (IMAC) architecture realizing both synaptic behavior and activation functions within non-volatile memory arrays. Spin-orbit torque magnetoresistive random-access memory (SOT-MRAM) devices are leveraged to realize sigmoidal neurons as well as binarized synapses. First, it is shown the proposed IMAC architecture can be utilized to realize a multilayer perceptron (MLP) classifier achieving orders of magnitude performance improvement compared to previous mixed-signal and digital implementations. Next, a heterogeneous mixed-signal and mixed-precision CPU-IMAC architecture is proposed for convolutional neural networks (CNNs) inference on mobile processors, in which IMAC is designed as a co-processor to realize fully-connected (FC) layers whereas convolution layers are executed in CPU. Architecture-level analytical models are developed to evaluate the performance and energy consumption of the CPU-IMAC architecture. Simulation results exhibit 6.5% and 10% energy savings for CPU-IMAC based realizations of LeNet and VGG CNN models, for MNIST and CIFAR-10 pattern recognition tasks, respectively.
引用
收藏
页码:188 / 193
页数:6
相关论文
共 50 条
  • [31] Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems
    Choong, Benjamin Chen Ming
    Luo, Tao
    Liu, Cheng
    He, Bingsheng
    Zhang, Wei
    Zhou, Joey Tianyi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 128
  • [32] An Energy-efficient Matrix Multiplication Accelerator by Distributed In-memory Computing on Binary RRAM Crossbar
    Ni, Leibin
    Wang, Yuhao
    Yu, Hao
    Yang, Wei
    Weng, Chuliang
    Zhao, Junfeng
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 280 - 285
  • [33] Energy-efficient multimodal zero-shot learning using in-memory reservoir computing
    Nature Computational Science, 2025, 5 (1): : 11 - 12
  • [34] A co-processor design of an energy efficient reconfigurable accelerator CMA
    Izawa, Mai
    Ozaki, Nobuaki
    Koizumi, Yusuke
    Uno, Rie
    Amano, Hideharu
    2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 148 - 154
  • [35] High Performance and Energy-Efficient In-Memory Computing Architecture based on SOT-MRAM
    He, Zhezhi
    Angizi, Shaahin
    Parveen, Farhana
    Fan, Deliang
    PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON NANOSCALE ARCHITECTURES (NANOARCH 2017), 2017, : 97 - 102
  • [36] An Energy-Efficient and Robust 10T SRAM based In-Memory Computing Architecture
    Srivastava, Noopur
    Rajput, Anil Kumar
    Pattanaik, Manisha
    Kaushal, Gaurav
    2023 36TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2023 22ND INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, VLSID, 2023, : 133 - 138
  • [37] Efficient large language model with analog in-memory computingNeuromorphic computing
    Anand Subramoney
    Nature Computational Science, 2025, 5 (1): : 5 - 6
  • [38] NEMO: An Energy-Efficient Hybrid Main Memory System for Mobile Devices
    Pourshirazi, Bahareh
    Zhu, Zhichun
    MEMSYS 2017: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2017, : 351 - 362
  • [39] Energy-Efficient Joint Partitioning and Offloading for Delay-Sensitive CNN Inference in Edge Computing
    Zha, Zhiyong
    Yang, Yifei
    Xia, Yongjun
    Wang, Zhaoyi
    Luo, Bin
    Li, Kaihong
    Ye, Chenkai
    Xu, Bo
    Peng, Kai
    Applied Sciences (Switzerland), 2024, 14 (19):
  • [40] Energy-Efficient Programable Analog Computing: Analog computing in a standard CMOS process
    Hasler, Jennifer
    IEEE Solid-State Circuits Magazine, 2024, 16 (04): : 32 - 40