A Scalable Analytical Memory Model for CPU Performance Prediction

被引：5

作者：

Chennupati, Gopinath ^{[1
]}

Santhi, Nandakishore ^{[1
]}

Bird, Robert ^{[1
]}

Thulasidasan, Sunil ^{[1
]}

Badawy, Abdel-Hameed A. ^{[2
]}

Misra, Satyajayant ^{[3
]}

Eidenbenz, Stephan ^{[1
]}

机构：

[1] Los Alamos Natl Lab, SM 30, Los Alamos, NM 87545 USA

[2] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA

[3] New Mexico State Univ, Comp Sci Dept, Las Cruces, NM 88003 USA

来源：

HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION (PMBS 2017) | 2018年 / 10724卷

关键词：

Performance modeling; Cache hierarchies; Reuse distance; Probabilistic models; LLVM; Basic blocks;

D O I：

10.1007/978-3-319-72971-8_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the US Department of Energy (DOE) invests in exascale computing, performance modeling of physics codes on CPUs remain a challenge in computational co-design due to the complex design of processors including memory hierarchies, instruction pipelining, and speculative execution. We present Analytical Memory Model (AMM), a model of cache hierarchies, embedded in the Performance Prediction Toolkit (PPT) - a suite of discrete-event-simulation-based co-design hardware and software models. AMM enables PPT to significantly improve the quality of its runtime predictions of scientific codes. AMM uses a computationally efficient, stochastic method to predict the reuse distance profiles, where reuse distance is a hardware architecture-independent measure of the patterns of virtual memory accesses. AMM relies on a stochastic, static basic block-level analysis of reuse profiles measured from the memory traces of applications on small instances. The analytical reuse profile is useful to estimate the effective latency and throughput of memory access, which in turn are used to predict the overall runtime of an application. Our experimental results demonstrate the scalability of AMM, where we report the error-rates of three benchmarks on two different hardware models.

引用

页码：114 / 135

页数：22

共 50 条

[1] AN ANALYTICAL MEMORY HIERARCHY MODEL FOR PERFORMANCE PREDICTION
Chennupati, Gopinath
Santhi, Nandakishore
Eidenbenz, Stephan
Thulasidasan, Sunil
[J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 908 - 919
[2] Scalable Performance Prediction of Codes with Memory Hierarchy and Pipelines
Chennupati, Gopinath
Santhi, Nandakishore
Eidenbenz, Stephan
[J]. PROCEEDINGS OF THE 2019 ACM SIGSIM CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION (SIGSIM-PADS'19), 2019, : 13 - 24
[3] Toward an Analytical Performance Model to Select between GPU and CPU Execution
Chikin, Artem
Amaral, Jose Nelson
Ali, Karim
Tiotto, Ettore
[J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 353 - 362
[4] Analytical model for performance prediction of linear resolver
Saneie, Hamid
Nasiri-Gheidari, Zahra
Tootoonchian, Farid
[J]. IET ELECTRIC POWER APPLICATIONS, 2017, 11 (08) : 1457 - 1465
[5] Performance Analysis of Cache Memory in CPU
Mankad, Viraj
Shah, Virag
Gajjar, Sachin
Shah, Dhaval
[J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 170 - 181
[6] Development of an analytical model for performance prediction of chemical FOR methods
El-Tayeb, M.
Abu El Ela, M.
El-Banbi, A.
Sayyouh, M. H.
[J]. OIL GAS-EUROPEAN MAGAZINE, 2019, 45 (04): : 201 - 207
[7] Modified analytical model for prediction of steam flood performance
Dutt, Ankit
Mandal, Ajay
[J]. JOURNAL OF PETROLEUM EXPLORATION AND PRODUCTION TECHNOLOGY, 2012, 2 (03) : 117 - 123
[8] AMM: Scalable Memory Reuse Model to Predict the Performance of Physics Codes
Chennupati, Gopinath
Santhi, Nandakishore
Eidenbenz, Stephan
Thulasidasan, Sunil
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 649 - 650
[9] Performance prediction - A case study using a scalable shared-virtual-memory machine
Sun, XH
Zhu, JP
[J]. IEEE PARALLEL & DISTRIBUTED TECHNOLOGY, 1996, 4 (04): : 36 - &
[10] Falcon: A Scalable Analytical Cache Model
Pitchanathan, Arjun
Grover, Kunwar
Grosser, Tobias
[J]. PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2024, 8 (PLDI):

← 1 2 3 4 5 →