AFEC: An Analytical Framework for Evaluating Cache Performance in Out-of-Order Processors

被引:0
|
作者
Ji, Kecheng [1 ]
Ling, Ming [1 ]
Wang, Qin [1 ]
Shi, Longxing [1 ]
Pan, Jianping [2 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Technol Res Ctr, Nanjing 210096, Jiangsu, Peoples R China
[2] Univ Victoria, Dept Comp Sci, Victoria, BC, Canada
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Evaluating cache performance is becoming critically important to predict the overall performance of out-of-order processors. Non-blocking caches, which are very common in out-of-order CPUs, can reduce the average cache miss penalty by overlapping multiple outstanding memory requests and merging different cache misses with the same cacheline address into one memory request. Normally, memory-level-parallelism (MLP) has been used as a metric to describe the concurrency of memory access. Unfortunately, due to the extremely dynamic dependences among the program memory references, it is very difficult to quantify MLP without time-consuming simulations. Moreover, the merging of multiple cache misses, which makes the average cache miss service time less than the physical DDR access latency, is seldom considered in the existing researches. In this paper, we propose a cache performance evaluation framework based on program trace analysis and analytical models to fast estimate MLP and the effective cache miss service time without simulations. Comparing with the results by Gem5 simulations of MobyBench 2.0, Mibench 1.0 and Mediabench II, the average accuracy of the modeled MLP and the average cache miss service time is higher than 91% and 92%, respectively. Combined with cache misses calculated by the stack distance theory, the average absolute error of CPU stall time (due to cache misses) is lower than 10%, while the evaluation time can be sped up by 35 times relative to the Gem5 full simulations.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [1] An Analytical Cache Performance Evaluation Framework for Embedded Out-of-Order Processors Using Software Characteristics
    Ji, Kecheng
    Ling, Ming
    Shi, Longxing
    Pan, Jianping
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (04)
  • [2] System performance analyses of out-of-order superscalar processors using analytical method
    Kim, HJ
    Kim, SM
    Choi, SB
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1999, E82A (06) : 927 - 938
  • [3] Out-of-order commit processors
    Cristal, A
    Ortega, D
    Llosa, J
    Valero, M
    [J]. 10TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2004, : 48 - 59
  • [4] A Mechanistic Performance Model for Superscalar Out-of-Order Processors
    Eyerman, Stijn
    Eeckhout, Lieven
    Karkhanis, Tejas
    Smith, James E.
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2009, 27 (02):
  • [5] Fast and Precise Cache Performance Estimation for Out-Of-Order Execution
    Douma, Roeland J.
    Altmeyer, Sebastian
    Pimentel, Andy D.
    [J]. 2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1132 - 1137
  • [6] Evaluating register allocation and instruction scheduling techniques in out-of-order issue processors
    Valluri, Madhavi Gopal
    Govindarajan, R.
    [J]. Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, 1999, : 78 - 83
  • [7] Enabling Intermittent Computing on High-Performance Out-of-Order Processors
    Sliper, Sivert T.
    Balsamo, Domenico
    Weddell, Alex S.
    Merrett, Geoff V.
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL WORKSHOP ON ENERGY HARVESTING & ENERGY-NEUTRAL SENSING SYSTEMS (ENSSYS '18), 2018, : 19 - 25
  • [8] High Performance Instruction Scheduling Circuits for Out-of-Order Soft Processors
    Wong, Henry
    Betz, Vaughn
    Rose, Jonathan
    [J]. 2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 9 - 16
  • [9] Modeling out-of-order processors for WCET analysis
    Li, Xianfeng
    Roychoudhury, Abhik
    Mitra, Tulika
    [J]. REAL-TIME SYSTEMS, 2006, 34 (03) : 195 - 227
  • [10] Using Condition Flag Prediction to Improve the Performance of Out-of-Order Processors
    Hsu, Tzu-Hsuan
    Lin, Ching-Wen
    Chen, Chung-Ho
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 1240 - 1243