Architectural Design of 3D NAND Flash based Compute-in-Memory for Inference Engine

被引:5
|
作者
Shim, Wonbo [1 ]
Jiang, Hongwu [1 ]
Peng, Xiaochen [1 ]
Yu, Shimeng [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源
PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2020 | 2020年
关键词
Deep neural network; hardware accelerator; compute-in-memory; 3D NAND Flash; MACRO; CMOS;
D O I
10.1145/3422575.3422779
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
3D NAND Flash memory has been proposed as an attractive candidate of inference engine for deep neural network (DNN) owing to its ultra-high density and commercially matured fabrication technology. However, the peripheral circuits require to be modified to enable compute-in-memory (CIM) and the chip architectures need to be redesigned for an optimized dataflow. In this work, we present a design of 3D NAND-CIM accelerator based on the macro parameters from an industry-grade prototype chip. The DNN inference performance is evaluated using the DNN+ NeuroSim framework. To exploit the ultra-high density of 3D NAND Flash, both inputs and weights duplication strategies are introduced to improve the throughput. The benchmarking on a variety of VGG and ResNet networks was performed across technological candidates for CIM including SRAM, RRAM and 3D NAND. Compared to similar designs with SRAM or RRAM, the result shows that 3D NAND based CIM design can achieve not only 17-24% chip size but also 1.92.7 times more competitive energy efficiency for 8-bit precision inference.
引用
收藏
页码:77 / 85
页数:9
相关论文
共 50 条
  • [1] System-Technology Codesign of 3-D NAND Flash-Based Compute-in-Memory Inference Engine
    Shim, Wonbo
    Yu, Shimeng
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2021, 7 (01): : 61 - 69
  • [2] A Compute-in-Memory Architecture Compatible with 3D NAND Flash that Parallelly Activates Multi-Layers
    Zhao, Liang
    Yan, Chu
    Yang, Fan
    Gao, Shifan
    Rosca, Gabriel
    Manea, Dan
    Lu, Zhichao
    Zhao, Yi
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 193 - 198
  • [3] Technological Design of 3D NAND-Based Compute-in-Memory Architecture for GB-Scale Deep Neural Network
    Shim, Wonbo
    Yu, Shimeng
    IEEE ELECTRON DEVICE LETTERS, 2021, 42 (02) : 160 - 163
  • [4] An Embedded NAND Flash-Based Compute-In-Memory Array Demonstrated in a Standard Logic Process
    Kim, Minsu
    Liu, Muqing
    Everson, Luke R.
    Kim, Chris H.
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2022, 57 (02) : 625 - 638
  • [5] Channel Modeling and Quantization Design for 3D NAND Flash Memory
    Wang, Cheng
    Mei, Zhen
    Li, Jun
    Shu, Feng
    He, Xuan
    Kong, Lingjun
    ENTROPY, 2023, 25 (07)
  • [6] Vertical 3D NAND Flash Memory Technology
    Nitayama, Akihiro
    Aochi, Hideaki
    ULSI PROCESS INTEGRATION 7, 2011, 41 (07): : 15 - 25
  • [7] Error Generation for 3D NAND Flash Memory
    Liu, Weihua
    Wu, Fei
    Meng, Songmiao
    Chen, Xiang
    Xie, Changsheng
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 56 - 59
  • [8] Monolithic 3D Compute-in-Memory Accelerator with BEOL Transistor based Reconfigurable Interconnect
    Luo, Yandong
    Dutta, Sourav
    Kaul, Ankit
    Lim, Sung-kyu
    Bakir, Muhannad
    Datta, Suman
    Yu, Shimeng
    2021 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2021,
  • [9] Compute-in-Memory: From Device Innovation to 3D System Integration
    Yu, Shimeng
    Shim, Wonbo
    Hur, Jae
    Luo, Yuan-chun
    Choe, Gihun
    Li, Wantong
    Lu, Anni
    Peng, Xiaochen
    IEEE 51ST EUROPEAN SOLID-STATE DEVICE RESEARCH CONFERENCE (ESSDERC 2021), 2021, : 21 - 28
  • [10] 3D RRAM DESIGN AND BENCHMARK WITH 3D NAND FLASH
    Chen, Pai-Yu
    Xu, Cong
    Xie, Yuan
    Yu, Shimeng
    2014 12TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2014,