Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators

被引:1
|
作者
Pal, Subhankar [1 ]
Venkataramani, Swagath [2 ]
Srinivasan, Viji [2 ]
Gopalakrishnan, Kailash [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
PERFORMANCE;
D O I
10.1109/ISPASS51385.2021.00046
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A prevalent challenge for Deep Learning (DL) accelerators is how they are programmed to sustain utilization without impacting end-user productivity. Little prior effort has been devoted to the effective management of their on-chip Scratch-Pad Memory (SPM) across the DL operations of a Deep Neural Network (DNN). This is especially critical due to trends in complex network topologies and the emergence of eager execution. This work demonstrates that there exists up to a 5.2x performance gap in DL inference to be bridged using SPM management, on a set of image, object and language networks. We propose OnSRAM, a novel SPM management framework integrated with a DL accelerator runtime. OnSRAM has two variants, viz. OnSRAM-Static, which works on static graphs to identify data structures that should be held on-chip based on their properties, and OnSRAM-Eager, which targets an eager execution model (no graph) and uses a speculative scheme to hold/discard data structures. On a prototypical DL accelerator, OnSRAM-Static and OnSRAM-Eager achieve reductions in inference latency (batch size of 1) of 1.02-4.8x and 1.02-3.1x, respectively, over a baseline with no SPM management.
引用
收藏
页码:240 / 242
页数:3
相关论文
共 50 条
  • [21] OPTIMAL DATA ALLOCATION ALGORITHM FOR LOOP-CENTRIC APPLICATIONS ON SCRATCH-PAD MEMORIES
    Gu, Yibo
    Zhuge, Qingfeng
    Zhang, Jun
    Hu, Jingtong
    Sha, Edwin H. -M.
    2013 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2013, : 383 - 388
  • [22] Compiler-Based Approach to Reducing Leakage Energy of Instruction Scratch-Pad Memories
    Huangfu, Yijie
    Zhang, Wei
    2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2013, : 439 - 442
  • [23] Energy oriented optimization of dynamic management strategy for scratch-pad memory
    Jing, J
    Pu, HL
    Ming, L
    2005 6TH INTERNATIONAL CONFERENCE ON ASIC PROCEEDINGS, BOOKS 1 AND 2, 2005, : 952 - 955
  • [24] Dynamic scratch-pad memory management with data pipelining for embedded systems
    Yang, Yanqin
    Wang, Meng
    Yan, Haijin
    Shao, Zili
    Guo, Minyi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (13): : 1874 - 1892
  • [25] Run-time Scratch-pad Memory Management for Embedded Systems
    Mueck, Tiago Rogerio
    Froehlich, Antonio Augusto
    IECON 2011: 37TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2011,
  • [26] Dynamic scratch-pad memory management for irregular array access patterns
    Chen, G.
    Ozturk, O.
    Kandemir, M.
    Karakoy, M.
    2006 DESIGN AUTOMATION AND TEST IN EUROPE, VOLS 1-3, PROCEEDINGS, 2006, : 929 - +
  • [27] Energy efficiency of scratch-pad memory in deep submicron domains: an empirical study
    Takase, Hideki
    Tomiyama, Hiroyuki
    Zeng, Gang
    Takada, Hiroaki
    IEICE ELECTRONICS EXPRESS, 2008, 5 (23) : 1010 - 1016
  • [28] Banked scratch-pad memory management for reducing leakage energy consumption.
    Kandemir, M
    Irwin, MJ
    Chen, G
    Kolcu, I
    ICCAD-2004: INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, IEEE/ACM DIGEST OF TECHNICAL PAPERS, 2004, : 120 - 124
  • [29] EVIDENCE FOR A VISUOSPATIAL SCRATCH-PAD IN WORKING MEMORY
    FARMER, EW
    BERMAN, JVF
    FLETCHER, YL
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION A-HUMAN EXPERIMENTAL PSYCHOLOGY, 1986, 38 (04): : 675 - 688
  • [30] Design Space Exploration for Optimal Memory Mapping of Data and Instructions in Multimedia Applications to Scratch-Pad Memories
    Iranpour, Ali R.
    Kuchcinski, Krzysztof
    2009 IEEE/ACM/IFIP 7TH WORKSHOP ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2009, : 89 - +