Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators

被引:1
|
作者
Pal, Subhankar [1 ]
Venkataramani, Swagath [2 ]
Srinivasan, Viji [2 ]
Gopalakrishnan, Kailash [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
PERFORMANCE;
D O I
10.1109/ISPASS51385.2021.00046
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A prevalent challenge for Deep Learning (DL) accelerators is how they are programmed to sustain utilization without impacting end-user productivity. Little prior effort has been devoted to the effective management of their on-chip Scratch-Pad Memory (SPM) across the DL operations of a Deep Neural Network (DNN). This is especially critical due to trends in complex network topologies and the emergence of eager execution. This work demonstrates that there exists up to a 5.2x performance gap in DL inference to be bridged using SPM management, on a set of image, object and language networks. We propose OnSRAM, a novel SPM management framework integrated with a DL accelerator runtime. OnSRAM has two variants, viz. OnSRAM-Static, which works on static graphs to identify data structures that should be held on-chip based on their properties, and OnSRAM-Eager, which targets an eager execution model (no graph) and uses a speculative scheme to hold/discard data structures. On a prototypical DL accelerator, OnSRAM-Static and OnSRAM-Eager achieve reductions in inference latency (batch size of 1) of 1.02-4.8x and 1.02-3.1x, respectively, over a baseline with no SPM management.
引用
收藏
页码:240 / 242
页数:3
相关论文
共 50 条
  • [1] MAGNETIC FILM SCRATCH-PAD MEMORIES
    POHM, AV
    IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (04): : 452 - &
  • [2] Architecture Extensions for Efficient Management of Scratch-Pad Memory
    Busquets-Mataix, Jose V.
    Catala, Carlos
    Marti-Campoy, Antonio
    INTEGRATED CIRCUIT AND SYSTEM DESIGN: POWER AND TIMING MODELING, OPTIMIZATION, AND SIMULATION, 2011, 6951 : 43 - 52
  • [3] Efficient utilization of scratch-pad memory banks
    State Key Laboratory of Microwave and Digital Communication, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    Qinghua Daxue Xuebao, 2006, 1 (31-34):
  • [4] Dataflow analysis for energy-efficient scratch-pad memory management
    Chen, GY
    Kandemir, M
    ISLPED '05: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2005, : 327 - 330
  • [5] Dynamic management of scratch-pad memory space
    Kandemir, M
    Ramanujam, J
    Irwin, MJ
    Vijaykrishnan, N
    Kadayif, I
    Parikh, A
    38TH DESIGN AUTOMATION CONFERENCE PROCEEDINGS 2001, 2001, : 690 - 695
  • [6] Optimal Data Placement for Memory Architectures with Scratch-Pad Memories
    Guo, Yibo
    Zhuge, Qingfeng
    Hu, Jingtong
    Sha, Edwin H. -M.
    TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1045 - 1050
  • [7] Shared scratch-pad memory space management
    Ozturk, Ozcan
    Kandemir, Mahmut
    Kolcu, Ibrahim
    ISQED 2006: PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, 2006, : 576 - +
  • [8] DRDU: A data reuse analysis technique for efficient scratch-pad memory management
    Issenin, Ilya
    Brockmeyer, Erik
    Miranda, Miguel
    Dutt, Nikil
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (02)
  • [9] Efficient Dynamic Heap Allocation of Scratch-Pad Memory
    Mcllroy, Ross
    Dickman, Peter
    Sventek, Joe
    ISMM'08: PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT, 2008, : 31 - +
  • [10] Efficient Utilization of Scratch-Pad Memory for Embedded Systems
    Hu, Wei
    Chen, Tianzhou
    Shi, Qingsong
    Sha, Feng
    2009 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), VOLS 1 AND 2, 2009, : 442 - 447