Compile Time Modeling of Off-Chip Memory Bandwidth for Parallel Loops

被引:0
|
作者
Tolubaeva, Munara [1 ]
Yan, Yonghong [1 ]
Chapman, Barbara [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
Off-chip memory bandwidth; Performance modeling; Parallel loops; Contentions;
D O I
10.1007/978-3-319-09967-5_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a statistical model to predict the off-chip memory bandwidth required by a parallel loop during its execution. It is a compile-time modeling technique that derives the correlations between memory bandwidth requirement and data access patterns of multithreaded applications. This model could be used by the compiler and performance tools to predict when the sustainable memory bandwidth of the system will be reached by the application during execution, and to determine an optimal number of threads that should be configured to execute a specific parallel loop according to its memory reference patterns. Awareness of the performance impact of oversubscribed memory bandwidth can also help programmers to take into account the additional latency caused by the contention, and to minimize the overhead by tuning the memory access behavior of applications. We evaluated this model in terms of both technical accuracy and prediction accuracy by comparing the modeling results with the measured results. The evaluation demonstrates its accuracy in both system bandwidth modeling and application bandwidth modeling.
引用
收藏
页码:292 / 306
页数:15
相关论文
共 50 条
  • [31] A high security and efficiency protection of confidentiality and integrity for off-chip memory
    Su, Yang
    Shen, Jun-Wei
    Zhang, Min-Qing
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (07) : 2633 - 2643
  • [32] On the Off-chip Memory Latency of Real-Time Systems: Is DDR DRAM Really the Best Option?
    Hassan, Mohamed
    2018 39TH IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2018), 2018, : 495 - 505
  • [33] Effects and modeling of simultaneous switching noise for BiCMOS off-chip drivers
    Secker, DA
    Prince, JL
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY PART B-ADVANCED PACKAGING, 1996, 19 (03): : 473 - 480
  • [34] Bandwidth-aware Last-level Caching: Efficiently Coordinating Off-chip Read and Write Bandwidth
    Chaudhuri, Mainak
    Gaur, Jayesh
    Subramoney, Sreenivas
    2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 109 - 118
  • [35] Effects and modeling of simultaneous switching noise for BiCMOS off-chip drivers
    Rambus Inc, Mountain View, United States
    IEEE Trans Compon Packag Manuf Technol Part B Adv Packag, 3 (473-480):
  • [36] Exploiting off-chip memory access modes in high-level synthesis
    Panda, PR
    Dutt, ND
    Nicolau, A
    1997 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN - DIGEST OF TECHNICAL PAPERS, 1997, : 333 - 340
  • [37] COMPILE-TIME PARTITIONING OF ITERATIVE PARALLEL LOOPS TO REDUCE CACHE COHERENCY TRAFFIC
    ABRAHAM, SG
    HUDAK, DE
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1991, 2 (03) : 318 - 328
  • [38] Single-ended coding techniques for off-chip interconnects to commodity memory
    Choudhury, Mihir
    Ringgenberg, Kyle
    Rixner, Scott
    Mohanram, Kartik
    2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 1072 - 1077
  • [39] PageVault: Securing Off-Chip Memory using Page-Based Authentication
    Tine, Blaise-Pascal
    Yalamanchili, Sudhakar
    MEMSYS 2017: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2017, : 293 - 304
  • [40] Design and implementation of a novel off-chip memory access path for graph computing
    Zhang X.
    Chang Y.
    Zhang K.
    Chen M.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2020, 42 (02): : 13 - 22