Monolithically Integrated RRAM- and CMOS-Based In-Memory Computing Optimizations for Efficient Deep Learning

被引:63
|
作者
Yin, Shihui [1 ]
Kim, Yulhwa [2 ]
Han, Xu [1 ]
Barnaby, Hugh [1 ]
Yu, Shimeng [3 ]
Luo, Yandong [3 ]
He, Wangxin [1 ]
Sun, Xiaoyu [3 ]
Kim, Jae-Joon [2 ]
Seo, Jae-sun [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85287 USA
[2] Pohang Univ Sci & Technol, Dept Creat IT Engn, Pohang, South Korea
[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
基金
美国国家科学基金会; 新加坡国家研究基金会;
关键词
Deep neural networks;
D O I
10.1109/MM.2019.2943047
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Resistive RAM (RRAM) has been presented as a promising memory technology toward deep neural network (DNN) hardware design, with nonvolatility, high density, high ON/OFF ratio, and compatibility with logic process. However, prior RRAM works for DNNs have shown limitations on parallelism for in-memory computing, array efficiency with large peripheral circuits, multilevel analog operation, and demonstration of monolithic integration. In this article, we propose circuit-/device-level optimizations to improve the energy and density of RRAM-based in-memory computing architectures. We report experimental results based on prototype chip design of 128 x 64 RRAM arrays and CMOS peripheral circuits, where RRAM devices are monolithically integrated in a commercial 90-nm CMOS technology. We demonstrate the CMOS peripheral circuit optimization using input-splitting scheme and investigate the implication of higher low resistance state on energy efficiency and robustness. Employing the proposed techniques, we demonstrate RRAM-based in-memory computing with up to 116.0 TOPS/W energy efficiency and 84.2% CIFAR-10 accuracy. Furthermore, we investigate four-level programming with single RRAM device, and report the system-level performance and DNN accuracy results using circuit-level benchmark simulator NeuroSim.
引用
收藏
页码:54 / 63
页数:10
相关论文
共 50 条
  • [1] High-Throughput In-Memory Computing for Binary Deep Neural Networks With Monolithically Integrated RRAM and 90-nm CMOS
    Yin, Shihui
    Sun, Xiaoyu
    Yu, Shimeng
    Seo, Jae-Sun
    [J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2020, 67 (10) : 4185 - 4192
  • [2] RRAM-Based In-Memory Computing for Embedded Deep Neural Networks
    Bankman, D.
    Messner, J.
    Gural, A.
    Murmann, B.
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1511 - 1515
  • [3] Deep learning acceleration based on in-memory computing
    Eleftheriou, E.
    Le Gallo, M.
    Nandakumar, S. R.
    Piveteau, C.
    Boybat, I
    Joshi, V
    Khaddam-Aljameh, R.
    Dazzi, M.
    Giannopoulos, I
    Karunaratne, G.
    Kersting, B.
    Stanisavljevic, M.
    Jonnalagadda, V. P.
    Ioannou, N.
    Kourtis, K.
    Francese, P. A.
    Sebastian, A.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2019, 63 (06)
  • [4] RRAM-based Analog In-Memory Computing
    Chen, Xiaoming
    Song, Tao
    Han, Yinhe
    [J]. 2021 IEEE/ACM INTERNATIONAL SYMPOSIUM ON NANOSCALE ARCHITECTURES (NANOARCH), 2021,
  • [5] PRIVE: Efficient RRAM Programming with Chip Verification for RRAM-based In-Memory Computing Acceleration
    He, Wangxin
    Meng, Jian
    Gonugondla, Sujan Kumar
    Yu, Shimeng
    Shanbhag, Naresh R.
    Seo, Jae-sun
    [J]. 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [6] Structured Pruning of RRAM Crossbars for Efficient In-Memory Computing Acceleration of Deep Neural Networks
    Meng, Jian
    Yang, Li
    Peng, Xiaochen
    Yu, Shimeng
    Fan, Deliang
    Seo, Jae-Sun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (05) : 1576 - 1580
  • [7] Sparse and Robust RRAM-based Efficient In-memory Computing for DNN Inference
    Meng, Jian
    Yeo, Injune
    Yang, Li
    Fan, Deliang
    Seo, Jae-sun
    Yu, Shimeng
    Shim, Wonbo
    [J]. 2022 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2022,
  • [8] Sparse and Robust RRAM-based Efficient In-memory Computing for DNN Inference
    Meng, Jian
    Yeo, Injune
    Shim, Wonbo
    Yang, Li
    Fan, Deliang
    Yu, Shimeng
    Seo, Jae-Sun
    [J]. 2022 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2022,
  • [9] 2-Bit-Per-Cell RRAM-Based In-Memory Computing for Area-/Energy-Efficient Deep Learning
    He, Wangxin
    Yin, Shihui
    Kim, Yulhwa
    Sun, Xiaoyu
    Kim, Jae-Joon
    Yu, Shimeng
    Seo, Jae-Sun
    [J]. IEEE SOLID-STATE CIRCUITS LETTERS, 2020, 3 : 194 - 197
  • [10] Logic Synthesis for RRAM-Based In-Memory Computing
    Shirinzadeh, Saeideh
    Soeken, Mathias
    Gaillardon, Pierre-Emmanuel
    Drechsler, Rolf
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (07) : 1422 - 1435