DRAM-Based Processor for Deep Neural Networks Without SRAM Cache

被引：0

作者：

Tam, Eugene ^{[1
]}

Jiang, Shenfei ^{[1
]}

Duan, Paul ^{[1
]}

Meng, Shawn ^{[1
]}

Pan, Yue ^{[1
]}

Huang, Cayden ^{[1
]}

Han, Yi ^{[1
]}

Xie, Jacke ^{[1
]}

Cui, Yuanjun ^{[1
]}

Yu, Jinsong ^{[1
]}

Lu, Minggui ^{[1
]}

机构：

[1] IC League Inc, Haining, Peoples R China

来源：

INTELLIGENT COMPUTING, VOL 2 | 2021年 / 284卷

关键词：

Neural network; Artificial intelligence; Processor; Deep learning;

D O I：

10.1007/978-3-030-80126-7_52

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Modern computing architectures use cache memory as the buffer between high speed computing units and low latency main memory. Higher capacity caches are thought to be critical for deep neural network processors, which handle large amounts of data. However, as cache memory capacity increases, it occupies large die area that can otherwise be used for computing units. This is the inherent trade off between memory capacity and performance. In this work, we present a deep neural network processing chip, with a near-memory computing architecture. We eliminate the SRAM cache and use DRAM only as on-chip memory, delivering high performance and high memory capacity.

引用

页码：743 / 753

页数：11

共 50 条

[21] A Unified Programmable Edge Matrix Processor for Deep Neural Networks and Matrix Algebra
George, Biji
Omer, Om Ji
Choudhury, Ziaul
Anoop, V
Subramoney, Sreenivas
[J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (05)
[22] Similarity based Deep Neural Networks
Lee, Seungyeon
Jo, Eunji
Hwang, Sangheum
Jung, Gyeong Bok
Kim, Dohyun
[J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2021, 21 (03) : 205 - 212
[23] An FPGA-Based Processor for Training Convolutional Neural Networks
Liu, Zhiqiang
Dou, Yong
Jiang, Jingfei
Wang, Qiang
Chow, Paul
[J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 207 - 210
[24] Memory_based processor array for artificial neural networks
Kim, Y
Noh, MJ
Han, TD
Kim, SD
Yang, SB
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 969 - 974
[25] An Approximate DRAM Design with an Adjustable Refresh Scheme for Low-power Deep Neural Networks
Duy Thanh Nguyen
Kim, Hyun
Lee, Hyuk-Jae
[J]. JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2021, 21 (02) : 134 - 142
[26] Deep Neural Networks Optimization Based On Deconvolutional Networks
Liu, Zhoufeng
Zhang, Chi
Li, Chunlei
Ding, Shumin
Liu, Shanliang
Dong, Yan
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING (ICGSP 2018), 2018, : 7 - 11
[27] Estimating neural networks-based algorithm for adaptive cache replacement
Obaidat, MS
Khalid, H
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (04): : 602 - 611
[28] The Research of Efficient Dual-Port SRAM Data Exchange without Waiting with FIFO-Based Cache
Qianqian, Alfred Ji
Zhao Ping
Cheng Sen
Tan Jingjing
Wei Xu
Wei Yong
[J]. WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 312 - +
[29] MACC-SRAM: A Multistep Accumulation Capacitor-Coupling In-Memory Computing SRAM Macro for Deep Convolutional Neural Networks
Zhang, Bo
Saikia, Jyotishman
Meng, Jian
Wang, Dewei
Kwon, Soonwan
Myung, Sungmeen
Kim, Hyunsoo
Kim, Sang Joon
Seo, Jae-Sun
Seok, Mingoo
[J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 59 (06) : 1938 - 1949
[30] Prediction based Execution on Deep Neural Networks
Song, Mingcong
Zhao, Jiechen
Hu, Yang
Zhang, Jiaqi
Li, Tao
[J]. 2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 752 - 763

← 1 2 3 4 5 →