Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators

被引：1

作者：

Pal, Subhankar ^{[1
]}

Venkataramani, Swagath ^{[2
]}

Srinivasan, Viji ^{[2
]}

Gopalakrishnan, Kailash ^{[2
]}

机构：

[1] Univ Michigan, Ann Arbor, MI 48109 USA

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA

来源：

2021 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2021) | 2021年

关键词：

PERFORMANCE;

D O I：

10.1109/ISPASS51385.2021.00046

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

A prevalent challenge for Deep Learning (DL) accelerators is how they are programmed to sustain utilization without impacting end-user productivity. Little prior effort has been devoted to the effective management of their on-chip Scratch-Pad Memory (SPM) across the DL operations of a Deep Neural Network (DNN). This is especially critical due to trends in complex network topologies and the emergence of eager execution. This work demonstrates that there exists up to a 5.2x performance gap in DL inference to be bridged using SPM management, on a set of image, object and language networks. We propose OnSRAM, a novel SPM management framework integrated with a DL accelerator runtime. OnSRAM has two variants, viz. OnSRAM-Static, which works on static graphs to identify data structures that should be held on-chip based on their properties, and OnSRAM-Eager, which targets an eager execution model (no graph) and uses a speculative scheme to hold/discard data structures. On a prototypical DL accelerator, OnSRAM-Static and OnSRAM-Eager achieve reductions in inference latency (batch size of 1) of 1.02-4.8x and 1.02-3.1x, respectively, over a baseline with no SPM management.

引用

页码：240 / 242

页数：3

共 50 条

[1] MAGNETIC FILM SCRATCH-PAD MEMORIES
POHM, AV
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (04): : 452 - &
[2] Architecture Extensions for Efficient Management of Scratch-Pad Memory
Busquets-Mataix, Jose V.
Catala, Carlos
Marti-Campoy, Antonio
INTEGRATED CIRCUIT AND SYSTEM DESIGN: POWER AND TIMING MODELING, OPTIMIZATION, AND SIMULATION, 2011, 6951 : 43 - 52
[3] Efficient utilization of scratch-pad memory banks
State Key Laboratory of Microwave and Digital Communication, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
Qinghua Daxue Xuebao, 2006, 1 (31-34):
[4] Dataflow analysis for energy-efficient scratch-pad memory management
Chen, GY
Kandemir, M
ISLPED '05: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2005, : 327 - 330
[5] Dynamic management of scratch-pad memory space
Kandemir, M
Ramanujam, J
Irwin, MJ
Vijaykrishnan, N
Kadayif, I
Parikh, A
38TH DESIGN AUTOMATION CONFERENCE PROCEEDINGS 2001, 2001, : 690 - 695
[6] Optimal Data Placement for Memory Architectures with Scratch-Pad Memories
Guo, Yibo
Zhuge, Qingfeng
Hu, Jingtong
Sha, Edwin H. -M.
TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1045 - 1050
[7] Shared scratch-pad memory space management
Ozturk, Ozcan
Kandemir, Mahmut
Kolcu, Ibrahim
ISQED 2006: PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, 2006, : 576 - +
[8] DRDU: A data reuse analysis technique for efficient scratch-pad memory management
Issenin, Ilya
Brockmeyer, Erik
Miranda, Miguel
Dutt, Nikil
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (02)
[9] Efficient Dynamic Heap Allocation of Scratch-Pad Memory
Mcllroy, Ross
Dickman, Peter
Sventek, Joe
ISMM'08: PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT, 2008, : 31 - +
[10] Efficient Utilization of Scratch-Pad Memory for Embedded Systems
Hu, Wei
Chen, Tianzhou
Shi, Qingsong
Sha, Feng
2009 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), VOLS 1 AND 2, 2009, : 442 - 447

← 1 2 3 4 5 →