DRAM-Based Processor for Deep Neural Networks Without SRAM Cache

被引:0
|
作者
Tam, Eugene [1 ]
Jiang, Shenfei [1 ]
Duan, Paul [1 ]
Meng, Shawn [1 ]
Pan, Yue [1 ]
Huang, Cayden [1 ]
Han, Yi [1 ]
Xie, Jacke [1 ]
Cui, Yuanjun [1 ]
Yu, Jinsong [1 ]
Lu, Minggui [1 ]
机构
[1] IC League Inc, Haining, Peoples R China
来源
关键词
Neural network; Artificial intelligence; Processor; Deep learning;
D O I
10.1007/978-3-030-80126-7_52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern computing architectures use cache memory as the buffer between high speed computing units and low latency main memory. Higher capacity caches are thought to be critical for deep neural network processors, which handle large amounts of data. However, as cache memory capacity increases, it occupies large die area that can otherwise be used for computing units. This is the inherent trade off between memory capacity and performance. In this work, we present a deep neural network processing chip, with a near-memory computing architecture. We eliminate the SRAM cache and use DRAM only as on-chip memory, delivering high performance and high memory capacity.
引用
收藏
页码:743 / 753
页数:11
相关论文
共 50 条
  • [21] A Unified Programmable Edge Matrix Processor for Deep Neural Networks and Matrix Algebra
    George, Biji
    Omer, Om Ji
    Choudhury, Ziaul
    Anoop, V
    Subramoney, Sreenivas
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (05)
  • [22] Similarity based Deep Neural Networks
    Lee, Seungyeon
    Jo, Eunji
    Hwang, Sangheum
    Jung, Gyeong Bok
    Kim, Dohyun
    [J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2021, 21 (03) : 205 - 212
  • [23] An FPGA-Based Processor for Training Convolutional Neural Networks
    Liu, Zhiqiang
    Dou, Yong
    Jiang, Jingfei
    Wang, Qiang
    Chow, Paul
    [J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 207 - 210
  • [24] Memory_based processor array for artificial neural networks
    Kim, Y
    Noh, MJ
    Han, TD
    Kim, SD
    Yang, SB
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 969 - 974
  • [25] An Approximate DRAM Design with an Adjustable Refresh Scheme for Low-power Deep Neural Networks
    Duy Thanh Nguyen
    Kim, Hyun
    Lee, Hyuk-Jae
    [J]. JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2021, 21 (02) : 134 - 142
  • [26] Deep Neural Networks Optimization Based On Deconvolutional Networks
    Liu, Zhoufeng
    Zhang, Chi
    Li, Chunlei
    Ding, Shumin
    Liu, Shanliang
    Dong, Yan
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING (ICGSP 2018), 2018, : 7 - 11
  • [27] Estimating neural networks-based algorithm for adaptive cache replacement
    Obaidat, MS
    Khalid, H
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (04): : 602 - 611
  • [28] The Research of Efficient Dual-Port SRAM Data Exchange without Waiting with FIFO-Based Cache
    Qianqian, Alfred Ji
    Zhao Ping
    Cheng Sen
    Tan Jingjing
    Wei Xu
    Wei Yong
    [J]. WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 312 - +
  • [29] MACC-SRAM: A Multistep Accumulation Capacitor-Coupling In-Memory Computing SRAM Macro for Deep Convolutional Neural Networks
    Zhang, Bo
    Saikia, Jyotishman
    Meng, Jian
    Wang, Dewei
    Kwon, Soonwan
    Myung, Sungmeen
    Kim, Hyunsoo
    Kim, Sang Joon
    Seo, Jae-Sun
    Seok, Mingoo
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 59 (06) : 1938 - 1949
  • [30] Prediction based Execution on Deep Neural Networks
    Song, Mingcong
    Zhao, Jiechen
    Hu, Yang
    Zhang, Jiaqi
    Li, Tao
    [J]. 2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 752 - 763