Compile Time Modeling of Off-Chip Memory Bandwidth for Parallel Loops

被引:0
|
作者
Tolubaeva, Munara [1 ]
Yan, Yonghong [1 ]
Chapman, Barbara [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
Off-chip memory bandwidth; Performance modeling; Parallel loops; Contentions;
D O I
10.1007/978-3-319-09967-5_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a statistical model to predict the off-chip memory bandwidth required by a parallel loop during its execution. It is a compile-time modeling technique that derives the correlations between memory bandwidth requirement and data access patterns of multithreaded applications. This model could be used by the compiler and performance tools to predict when the sustainable memory bandwidth of the system will be reached by the application during execution, and to determine an optimal number of threads that should be configured to execute a specific parallel loop according to its memory reference patterns. Awareness of the performance impact of oversubscribed memory bandwidth can also help programmers to take into account the additional latency caused by the contention, and to minimize the overhead by tuning the memory access behavior of applications. We evaluated this model in terms of both technical accuracy and prediction accuracy by comparing the modeling results with the measured results. The evaluation demonstrates its accuracy in both system bandwidth modeling and application bandwidth modeling.
引用
收藏
页码:292 / 306
页数:15
相关论文
共 50 条
  • [21] Practical Off-chip Meta-data for Temporal Memory Streaming
    Wenisch, Thomas F.
    Ferdman, Michael
    Ailamaki, Anastasia
    Falsafi, Babak
    Moshovos, Andreas
    HPCA-15 2009: FIFTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2009, : 79 - +
  • [22] Improving the Off-chip Bandwidth Utilization of Chip Multiprocessors (CMP) Using Early Write-Back
    Al-Tarawneh, Mutaz
    Botros, Nazeih
    WMSCI 2010: 14TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, 2010, : 130 - 135
  • [23] IVEC: Off-Chip Memory Integrity Protection for Both Security and Reliability
    Huang, Ruirui
    Suh, G. Edward
    ISCA 2010: THE 37TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2010, : 395 - 406
  • [24] EGCN: An Efficient GCN Accelerator for Minimizing Off-Chip Memory Access
    Han, Yunki
    Park, Kangkyu
    Jung, Youngbeom
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (12) : 3127 - 3139
  • [25] A high security and efficiency protection of confidentiality and integrity for off-chip memory
    Yang Su
    Jun-Wei Shen
    Min-Qing Zhang
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 2633 - 2643
  • [26] Analysis of the Effect of Off-chip Memory Access on the Performance of an NPU System
    Lee, Keonjoo
    Kang, Donghyun
    Kang, Duseok
    Ha, Soonhoi
    PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 13 - 18
  • [27] Optimizing Off-Chip Memory Access for Deep Neural Network Accelerator
    Zheng, Yong
    Yang, Haigang
    Shu, Yi
    Jia, Yiping
    Huang, Zhihong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (04) : 2316 - 2320
  • [28] SmartShuttle: Optimizing Off-Chip Memory Accesses for Deep Learning Accelerators
    Li, Jiajun
    Yan, Guihai
    Lu, Wenyan
    Jiang, Shuhao
    Gong, Shijun
    Wu, Jingya
    Li, Xiaowei
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 343 - 348
  • [29] Centaur: Hybrid Processing in On/Off-chip Memory Architecture for Graph Analytics
    Addisie, Abraham
    Bertacco, Valeria
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [30] Dynamic Power Management of Off-Chip Links for Hybrid Memory Cubes
    Ahn, Junwhan
    Yoo, Sungjoo
    Choi, Kiyoung
    2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,