A Scalable Analytical Memory Model for CPU Performance Prediction

被引:5
|
作者
Chennupati, Gopinath [1 ]
Santhi, Nandakishore [1 ]
Bird, Robert [1 ]
Thulasidasan, Sunil [1 ]
Badawy, Abdel-Hameed A. [2 ]
Misra, Satyajayant [3 ]
Eidenbenz, Stephan [1 ]
机构
[1] Los Alamos Natl Lab, SM 30, Los Alamos, NM 87545 USA
[2] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA
[3] New Mexico State Univ, Comp Sci Dept, Las Cruces, NM 88003 USA
关键词
Performance modeling; Cache hierarchies; Reuse distance; Probabilistic models; LLVM; Basic blocks;
D O I
10.1007/978-3-319-72971-8_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the US Department of Energy (DOE) invests in exascale computing, performance modeling of physics codes on CPUs remain a challenge in computational co-design due to the complex design of processors including memory hierarchies, instruction pipelining, and speculative execution. We present Analytical Memory Model (AMM), a model of cache hierarchies, embedded in the Performance Prediction Toolkit (PPT) - a suite of discrete-event-simulation-based co-design hardware and software models. AMM enables PPT to significantly improve the quality of its runtime predictions of scientific codes. AMM uses a computationally efficient, stochastic method to predict the reuse distance profiles, where reuse distance is a hardware architecture-independent measure of the patterns of virtual memory accesses. AMM relies on a stochastic, static basic block-level analysis of reuse profiles measured from the memory traces of applications on small instances. The analytical reuse profile is useful to estimate the effective latency and throughput of memory access, which in turn are used to predict the overall runtime of an application. Our experimental results demonstrate the scalability of AMM, where we report the error-rates of three benchmarks on two different hardware models.
引用
收藏
页码:114 / 135
页数:22
相关论文
共 50 条
  • [1] AN ANALYTICAL MEMORY HIERARCHY MODEL FOR PERFORMANCE PREDICTION
    Chennupati, Gopinath
    Santhi, Nandakishore
    Eidenbenz, Stephan
    Thulasidasan, Sunil
    [J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 908 - 919
  • [2] Scalable Performance Prediction of Codes with Memory Hierarchy and Pipelines
    Chennupati, Gopinath
    Santhi, Nandakishore
    Eidenbenz, Stephan
    [J]. PROCEEDINGS OF THE 2019 ACM SIGSIM CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION (SIGSIM-PADS'19), 2019, : 13 - 24
  • [3] Toward an Analytical Performance Model to Select between GPU and CPU Execution
    Chikin, Artem
    Amaral, Jose Nelson
    Ali, Karim
    Tiotto, Ettore
    [J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 353 - 362
  • [4] Analytical model for performance prediction of linear resolver
    Saneie, Hamid
    Nasiri-Gheidari, Zahra
    Tootoonchian, Farid
    [J]. IET ELECTRIC POWER APPLICATIONS, 2017, 11 (08) : 1457 - 1465
  • [5] Performance Analysis of Cache Memory in CPU
    Mankad, Viraj
    Shah, Virag
    Gajjar, Sachin
    Shah, Dhaval
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 170 - 181
  • [6] Development of an analytical model for performance prediction of chemical FOR methods
    El-Tayeb, M.
    Abu El Ela, M.
    El-Banbi, A.
    Sayyouh, M. H.
    [J]. OIL GAS-EUROPEAN MAGAZINE, 2019, 45 (04): : 201 - 207
  • [7] Modified analytical model for prediction of steam flood performance
    Dutt, Ankit
    Mandal, Ajay
    [J]. JOURNAL OF PETROLEUM EXPLORATION AND PRODUCTION TECHNOLOGY, 2012, 2 (03) : 117 - 123
  • [8] AMM: Scalable Memory Reuse Model to Predict the Performance of Physics Codes
    Chennupati, Gopinath
    Santhi, Nandakishore
    Eidenbenz, Stephan
    Thulasidasan, Sunil
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 649 - 650
  • [9] Performance prediction - A case study using a scalable shared-virtual-memory machine
    Sun, XH
    Zhu, JP
    [J]. IEEE PARALLEL & DISTRIBUTED TECHNOLOGY, 1996, 4 (04): : 36 - &
  • [10] Falcon: A Scalable Analytical Cache Model
    Pitchanathan, Arjun
    Grover, Kunwar
    Grosser, Tobias
    [J]. PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2024, 8 (PLDI):