Optimization of DRAM based PIM Architecture for Energy-Efficient Deep Neural Network Training

被引：3

作者：

Sudarshan, Chirag ^{[1
]}

Sadi, Mohammad Hassani ^{[1
]}

Weis, Christian ^{[1
]}

Wehn, Norbert ^{[1
]}

机构：

[1] Tech Univ Kaiserslautern, Microelect Syst Design Res Grp, Kaiserslautern, Germany

来源：

2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22) | 2022年

关键词：

Processing-in-Memory; PIM; Compute-in-Memory; DRAM; DNN training; MEMORY;

D O I：

10.1109/ISCAS48785.2022.9937832

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep Neural Network (DNN) training consumes high-energy. On the other hand, DNNs deployed on edge devices demand very high-energy efficiency. In this context, Processing-in-Memory (PIM) is an emerging compute paradigm that bridges the memory-computation gap to improve the energy-efficiency. DRAMs are one such memory type employed for designing energy-efficient PIM architectures for DNN training. One of the major issues of DRAM-PIM architectures designed for DNN training is the high number of internal data accesses within a bank between the memory arrays and the PIM computation units (e.g. 51% more than inference). These internal data accesses in the state-of-the-art DRAM PIM architectures consume very high energy compared to computation units. Hence, it is important to reduce the internal data access energy within the DRAM bank for further improving the energy efficiency of DRAM-PIM architectures. We present three novel optimizations that together reduce the internal data access energy up to 81.54%. Our first optimization modifies the bank data access circuit to enable partial accesses of data instead of the conventional fixed granularity accesses, thereby exploiting the available sparsity during training. The second optimization is to have a dedicated low-energy region within the DRAM bank that has low capacitive load of global wires and shorter data movement. Finally, we propose a 12-bit high dynamic range floating-point format called TinyFloat that reduces the total number of data access energy by 20% compared to IEEE 754 half and single precision.

引用

页码：1472 / 1476

页数：5

共 50 条

[1] Challenges in Energy-Efficient Deep Neural Network Training with FPGA
Tao, Yudong
Ma, Rui
Shyu, Mei-Ling
Chen, Shu-Ching
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1602 - 1611
[2] DRAMA: An Approximate DRAM Architecture for High-performance and Energy-efficient Deep Training System
Duy-Thanh Nguyen
Min, Chang-Hong
Nhut-Minh Ho
Chang, Ik-Joon
[J]. 2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
[3] Hybrid Convolution Architecture for Energy-Efficient Deep Neural Network Processing
Kim, Suchang
Jo, Jihyuck
Park, In-Cheol
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (05) : 2017 - 2029
[4] Computational Storage for an Energy-Efficient Deep Neural Network Training System
Li, Shiju
Tang, Kevin
Lim, Jin
Lee, Chul-Ho
Kim, Jongryool
[J]. EURO-PAR 2023: PARALLEL PROCESSING, 2023, 14100 : 304 - 319
[5] PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-Efficient ReRAM
Ankit, Aayush
El Hajj, Izzat
Chalamalasetti, Sai Rahul
Agarwal, Sapan
Marinella, Matthew
Foltin, Martin
Strachan, John Paul
Milojicic, Dejan
Hwu, Wen-Mei
Roy, Kaushik
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (08) : 1128 - 1142
[6] A Weighted Current Summation Based Mixed Signal DRAM-PIM Architecture for Deep Neural Network Inference
Sudarshan, Chirag
Soliman, Taha
Lappas, Jan
Weis, Christian
Sadi, Mohammad Hassani
Jung, Matthias
Guntoro, Andre
Wehn, Norbert
[J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 367 - 380
[7] Energy-Efficient Deep Neural Network Optimization via Pooling-Based Input Masking
Ren, Jiankang
Lv, Huawei
Bi, Ran
Liu, Qian
Ni, Zheng
Tan, Guozhen
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[8] An Energy-Efficient Deep Neural Network Accelerator Design
Jung, Jueun
Lee, Kyuho Jason
[J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
[9] An Energy-Efficient Convolutional Neural Network Processor Architecture Based on a Systolic Array
Zhang, Chen
Wang, Xin'an
Yong, Shanshan
Zhang, Yining
Li, Qiuping
Wang, Chenyang
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
[10] EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM
Koppula, Skanda
Orosa, Lois
Yaglikci, A. Giray
Azizi, Roknoddin
Shahroodi, Taha
Kanellopoulos, Konstantinos
Mutlu, Onur
[J]. MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 166 - 181

← 1 2 3 4 5 →