Deep Learning Acceleration using Digital-based Processing In-Memory

被引：1

作者：

Imani, Mohsen ^{[1
]}

Gupta, Saransh ^{[3
]}

Kim, Yeseong ^{[2
]}

Rosing, Tajana ^{[3
]}

机构：

[1] UC Irvine, Dept Comp Sci, Irvine, CA 92697 USA

[2] DGIST, Dept Informat & Commun Engn, Daegu, South Korea

[3] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA USA

来源：

2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC) | 2020年

关键词：

NEURAL-NETWORK;

D O I：

10.1109/SOCC49529.2020.9524776

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Processing In-Memory (PIM) has shown a great potential to accelerate inference tasks of Convolutional Neural Network (CNN). However, existing PIM architectures do not support high precision computation. e.g., in floating point precision, which is essential for training accurate CNN models. In addition, most of the existing PIM approaches require analog/mixed-signal circuits, which do not scale, exploiting insufficiently reliable multi-hit Non-Volatile Memory (NVM). In this paper, we propose FloatPIM, a fully-digital scalable PIM architecture that accelerates CNN in both training and testing phases. FloatPlM natively supports floating-point representation, thus enabling accurate CNN training. FloatPIN I also enables fast communication between neighboring memory blocks to reduce internal data movement of the PIM architecture. We break the CNN computation into computing and data transfer modes. In computing mode, all blocks are processing a part of CNN training/testing in parallel, while in data transfer mode Float-PIM enables fast and row-parallel communication between the neighbor blocks. Our evaluation shows that FloatPIM training is on average 303.2x and 48.6x (4.3x and I5.8x) faster and more energy efficient as compared to GTX 1080 GPU (PipeLayer [1] NM accelerator).

引用

页码：123 / 128

页数：6

共 50 条

[1] DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-Memory
Imani, Mohsen
Pampana, Saikishan
Gupta, Saransh
Zhou, Minxuan
Kim, Yeseong
Rosing, Tajana
[J]. 2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, : 356 - 371
[2] DigitalPIM: Digital-based Processing In-Memory for Big Data Acceleration
Imani, Mohsen
Gupta, Saransh
Kim, Yeseong
Zhou, Minxuan
Rosing, Tajana
[J]. GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 429 - 434
[3] Deep learning acceleration based on in-memory computing
Eleftheriou, E.
Le Gallo, M.
Nandakumar, S. R.
Piveteau, C.
Boybat, I
Joshi, V
Khaddam-Aljameh, R.
Dazzi, M.
Giannopoulos, I
Karunaratne, G.
Kersting, B.
Stanisavljevic, M.
Jonnalagadda, V. P.
Ioannou, N.
Kourtis, K.
Francese, P. A.
Sebastian, A.
[J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2019, 63 (06)
[4] Digital-based Processing In-Memory: A Highly-Parallel Accelerator for Data Intensive Applications
Imani, Mohsen
Gupta, Saransh
Rosing, Tajana
[J]. MEMSYS 2019: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2019, : 38 - 40
[5] ALPINE: Analog In-Memory Acceleration With Tight Processor Integration for Deep Learning
Klein, Joshua
Boybat, Irem
Qureshi, Yasir Mahmood
Dazzi, Martino
Levisse, Alexandre
Ansaloni, Giovanni
Zapater, Marina
Sebastian, Abu
Atienza, David
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (07) : 1985 - 1998
[6] Digital In-Memory Computing to Accelerate Deep Learning Inference on the Edge
Perri, Stefania
Zambelli, Cristian
Ielmini, Daniele
Silvano, Cristina
[J]. 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 130 - 133
[7] Acceleration of HadoopMapReduce using in-memory Computing
Seelam, Siva Kumar
Pattabiraman, V
[J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING (ICRTAC-CPS 2018), 2018, : 91 - 96
[8] NNPIM: A Processing In-Memory Architecture for Neural Network Acceleration
Gupta, Saransh
Imani, Mohsen
Kaur, Harveen
Rosing, Tajana Simunic
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (09) : 1325 - 1337
[9] In-Memory Computing for Machine Learning and Deep Learning
Lepri, N.
Glukhov, A.
Cattaneo, L.
Farronato, M.
Mannocci, P.
Ielmini, D.
[J]. IEEE JOURNAL OF THE ELECTRON DEVICES SOCIETY, 2023, 11 : 587 - 601
[10] Optimizing for In-Memory Deep Learning With Emerging Memory Technology
Wang, Zhehui
Luo, Tao
Goh, Rick Siow Mong
Zhang, Wei
Wong, Weng-Fai
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15

← 1 2 3 4 5 →