Compile Time Modeling of Off-Chip Memory Bandwidth for Parallel Loops

被引:0
|
作者
Tolubaeva, Munara [1 ]
Yan, Yonghong [1 ]
Chapman, Barbara [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
Off-chip memory bandwidth; Performance modeling; Parallel loops; Contentions;
D O I
10.1007/978-3-319-09967-5_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a statistical model to predict the off-chip memory bandwidth required by a parallel loop during its execution. It is a compile-time modeling technique that derives the correlations between memory bandwidth requirement and data access patterns of multithreaded applications. This model could be used by the compiler and performance tools to predict when the sustainable memory bandwidth of the system will be reached by the application during execution, and to determine an optimal number of threads that should be configured to execute a specific parallel loop according to its memory reference patterns. Awareness of the performance impact of oversubscribed memory bandwidth can also help programmers to take into account the additional latency caused by the contention, and to minimize the overhead by tuning the memory access behavior of applications. We evaluated this model in terms of both technical accuracy and prediction accuracy by comparing the modeling results with the measured results. The evaluation demonstrates its accuracy in both system bandwidth modeling and application bandwidth modeling.
引用
收藏
页码:292 / 306
页数:15
相关论文
共 50 条
  • [41] Low-power, transparent optical network interface for high bandwidth off-chip interconnects
    Liboiron-Ladouceur, Odile
    Wang, Howard
    Garg, Ajay S.
    Bergman, Keren
    OPTICS EXPRESS, 2009, 17 (08): : 6550 - 6561
  • [42] Bus Width Aware Off-Chip Memory Access Minimization for CNN Accelerators
    Tewari, Saurabh
    Kumar, Anshul
    Paul, Kolin
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 240 - 245
  • [43] MODELING THE MEMORY OF THE CRAY2 FOR COMPILE TIME OPTIMIZATION
    EISENBEIS, C
    JALBY, W
    LICHNEWSKY, A
    SUPERCOMPUTING /, 1989, 62 : 157 - 171
  • [44] Analysis of high-bandwidth low-power microring links for off-chip interconnects
    Ophir, Noam
    Bergman, Keren
    OPTOELECTRONIC INTEGRATED CIRCUITS XV, 2013, 8628
  • [45] Compile-time energy optimization for parallel applications in on-chip multiprocessors
    Chen, Juan
    Yi, Huizhan
    Yang, Xuejun
    Qian, Liang
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 904 - 911
  • [46] Modeling and electrical analysis of seamless high off-chip connectivity (SHOCC) interconnects
    Afonso, S
    Schaper, LW
    Parkerson, JP
    Brown, WD
    Ang, SS
    Naseem, HA
    1999 INTERNATIONAL CONFERENCE ON HIGH DENSITY PACKAGING AND MCMS, PROCEEDINGS, 1999, 3830 : 327 - 331
  • [47] Modeling and electrical analysis of seamless high off-chip connectivity (SHOCC) interconnects
    Afonso, S
    Schaper, LW
    Parkerson, JP
    Brown, WD
    Ang, SS
    Naseem, HA
    IEEE TRANSACTIONS ON ADVANCED PACKAGING, 1999, 22 (03): : 309 - 320
  • [48] A practical approach of memory access parallelization to exploit multiple off-chip DDR memories
    Kwon, Woo-Cheol
    Yoo, Sungjoo
    Hong, Sung-Min
    Min, Byeong
    Choi, Kyu-Myung
    Eo, Soo-Kwan
    2008 45TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2008, : 447 - 452
  • [49] Live Demonstration: Real-Time Image Classification on a Neuromorphic Computing System with Zero Off-chip Memory Access
    Shin, Taehwan
    Kang, Yongshin
    Yang, Seungho
    Kim, Seban
    Chung, Jaeyong
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 449 - 449
  • [50] Modeling and Co-Simulation of I/O Interconnects for On-Chip and Off-Chip EMI Prediction
    Kwak, SangKeun
    Jo, Jeongmin
    Kim, SoYoung
    2012 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (APEMC), 2012, : 821 - 824