An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices

被引:11
|
作者
Elbtity, Mohammed [1 ]
Singh, Abhishek [2 ]
Reidy, Brendan [1 ]
Guo, Xiaochen [2 ]
Zand, Ramtin [1 ]
机构
[1] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Lehigh Univ, Dept Elect & Comp Engn, Bethlehem, PA 18015 USA
关键词
in-memory computing; magnetic random access memory (MRAM); convolutional neural networks (CNNs); mixed-precision and mixed-signal inference;
D O I
10.1109/ISVLSI51109.2021.00043
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we develop an in-memory analog computing (IMAC) architecture realizing both synaptic behavior and activation functions within non-volatile memory arrays. Spin-orbit torque magnetoresistive random-access memory (SOT-MRAM) devices are leveraged to realize sigmoidal neurons as well as binarized synapses. First, it is shown the proposed IMAC architecture can be utilized to realize a multilayer perceptron (MLP) classifier achieving orders of magnitude performance improvement compared to previous mixed-signal and digital implementations. Next, a heterogeneous mixed-signal and mixed-precision CPU-IMAC architecture is proposed for convolutional neural networks (CNNs) inference on mobile processors, in which IMAC is designed as a co-processor to realize fully-connected (FC) layers whereas convolution layers are executed in CPU. Architecture-level analytical models are developed to evaluate the performance and energy consumption of the CPU-IMAC architecture. Simulation results exhibit 6.5% and 10% energy savings for CPU-IMAC based realizations of LeNet and VGG CNN models, for MNIST and CIFAR-10 pattern recognition tasks, respectively.
引用
收藏
页码:188 / 193
页数:6
相关论文
共 50 条
  • [21] An Energy-Efficient Mixed-Bit CNN Accelerator With Column Parallel Readout for ReRAM-Based In-Memory Computing
    Liu, Dingbang
    Zhou, Haoxiang
    Mao, Wei
    Liu, Jun
    Han, Yuliang
    Man, Changhai
    Wu, Qiuping
    Guo, Zhiru
    Huang, Mingqiang
    Luo, Shaobo
    Lv, Mingsong
    Chen, Quan
    Yu, Hao
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (04) : 821 - 834
  • [22] An Energy-Efficient Nonvolatile In-Memory Computing Architecture for Extreme Learning Machine by Domain-Wall Nanowire Devices
    Wang, Yuhao
    Yu, Hao
    Ni, Leibin
    Huang, Guang-Bin
    Yan, Mei
    Weng, Chuliang
    Yang, Wei
    Zhao, Junfeng
    IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2015, 14 (06) : 998 - 1012
  • [23] One-Transistor-Multiple-RRAM Cells for Energy-Efficient In-Memory Computing
    Uhlmann, Max
    Quesada, Emilio Perez-Bosch
    Fritscher, Markus
    Perez, Eduardo
    Schubert, Markus Andreas
    Reichenbach, Marc
    Ostrovskyy, Philip
    Wenger, Christian
    Kahmen, Gerhard
    2023 21ST IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS, 2023,
  • [24] AdaOper: Energy-efficient and Responsive Concurrent DNN Inference on Mobile Devices
    Lin, Zheng
    Guo, Bin
    Liu, Sicong
    Zhou, Wentao
    Ding, Yasan
    Zhang, Yu
    Yu, Zhiwen
    PROCEEDINGS OF THE 2024 ADAAIOTSYS 2024-WORKSHOP ON ADAPTIVE AIOT SYSTEMS, ADAAIOTSYS 2024, 2024, : 19 - 20
  • [25] Energy-Efficient, Two-Dimensional Analog Memory for Neuromorphic Computing
    Sharbati, Mohammad T.
    Du, Yanhao
    Xiong, Feng
    2018 76TH DEVICE RESEARCH CONFERENCE (DRC), 2018,
  • [26] Digital-Assisted Analog In-Memory Computing with RRAM Devices
    Wang, Zhenyu
    Nalla, Pragnya Sudershan
    Krishnan, Gokul
    Joshi, Rajiv V.
    Cady, Nathaniel C.
    Fan, Deliang
    Seo, Jae-sun
    Cao, Yu
    2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,
  • [27] Buffer Sizes Reduction for Memory-efficient CNN Inference on Mobile and Embedded Devices
    Minakova, Svetlana
    Stefanov, Todor
    2020 23RD EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2020), 2020, : 133 - 140
  • [28] SOT-MRAM based Analog in-Memory Computing for DNN inference
    Doevenspeck, J.
    Garello, K.
    Verhoef, B.
    Degraeve, R.
    Van Beek, S.
    Crotti, D.
    Yasin, F.
    Couet, S.
    Jayakumar, G.
    Papistas, I. A.
    Debacker, P.
    Lauwereins, R.
    Dehaene, W.
    Kar, G. S.
    Cosemans, S.
    Mallik, A.
    Verkest, D.
    2020 IEEE SYMPOSIUM ON VLSI TECHNOLOGY, 2020,
  • [29] Devices and Architectures for Efficient Computing In-Memory (CIM) Design
    Bengel, Christopher
    Gebregiorgis, Anteneh
    Menzel, Stephan
    Waser, Rainer
    Gaydadjiev, Georgi
    Hamdioui, Said
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14385 LNCS : 437 - 450
  • [30] Reinforcement Learning Based Energy-Efficient Collaborative Inference for Mobile Edge Computing
    Xiao, Yilin
    Xiao, Liang
    Wan, Kunpeng
    Yang, Helin
    Zhang, Yi
    Wu, Yi
    Zhang, Yanyong
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (02) : 864 - 876