Analysis of the Effect of Off-chip Memory Access on the Performance of an NPU System

被引:2
|
作者
Lee, Keonjoo [1 ]
Kang, Donghyun [1 ]
Kang, Duseok [1 ]
Ha, Soonhoi [1 ]
机构
[1] Seoul Natl Univ, 1 Gwanak Ro, Seoul, South Korea
关键词
Off-chip memory access; NPU performance;
D O I
10.1109/ISQED54688.2022.9806203
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Numerous CNN accelerators, called neural processing units (NPUs), have been proposed and developed recently to accelerate CNN computation with a customized chip. To minimize the DRAM access volume, NPUs commonly have a large on-chip memory and try to reuse the fetched data from the off-chip DRAM maximally While extensive researches have been conducted to minimize the effect of off-chip DRAM access on the performance in the NPU design, little attention is paid to the detailed analysis of the off-chip DRAM access overhead on the NPU performance. In this paper, we analyze the effects of off-chip DRAM access latency on the NPU performance and how the off-chip SRAM changes the DRAM access latency based on a cycle-accurate system simulation environment.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [1] Exploring Wireless Technology for Off-Chip Memory Access
    Sikder, Md Ashif I.
    DiTomaso, Dominic
    Kodi, Avinash
    Kaya, Savas
    Rayess, William
    Matolak, David
    2016 IEEE 24TH ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI), 2016, : 92 - 99
  • [2] Minimizing Off-Chip Memory Access for CNN Accelerators
    Tewari, Saurabh
    Kumar, Anshul
    Paul, Kolin
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (03) : 95 - 104
  • [3] Understanding How Off-Chip Memory Bandwidth Partitioning in Chip Multiprocessors Affects System Performance
    Liu, Fang
    Jiang, Xiaowei
    Solihin, Yan
    HPCA-16 2010: SIXTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2010, : 57 - 68
  • [4] EGCN: An Efficient GCN Accelerator for Minimizing Off-Chip Memory Access
    Han, Yunki
    Park, Kangkyu
    Jung, Youngbeom
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (12) : 3127 - 3139
  • [5] Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator
    Zheng, Yong
    Yang, Haigang
    Shu, Yi
    Jia, Yiping
    Huang, Zhihong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (04) : 2316 - 2320
  • [6] An Analytical Performance Model for Partitioning Off-Chip Memory Bandwidth
    Wang, Ruisheng
    Chen, Lizhong
    Pinkston, Timothy Mark
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 165 - 176
  • [7] Exploiting off-chip memory access modes in high-level synthesis
    Panda, PR
    Dutt, ND
    Nicolau, A
    1997 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN - DIGEST OF TECHNICAL PAPERS, 1997, : 333 - 340
  • [8] Design and implementation of a novel off-chip memory access path for graph computing
    Zhang X.
    Chang Y.
    Zhang K.
    Chen M.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2020, 42 (02): : 13 - 22
  • [9] Bus Width Aware Off-Chip Memory Access Minimization for CNN Accelerators
    Tewari, Saurabh
    Kumar, Anshul
    Paul, Kolin
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 240 - 245
  • [10] OSM: Off-Chip Shared Memory for GPUs
    Darabi, Sina
    Yousefzadeh-Asl-Miandoab, Ehsan
    Akbarzadeh, Negar
    Falahati, Hajar
    Lotfi-Kamran, Pejman
    Sadrosadati, Mohammad
    Sarbazi-Azad, Hamid
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3415 - 3429