Runahead execution: An alternative to very large instruction windows for out-of-order processors

被引:152
|
作者
Mutlu, O [1 ]
Stark, J [1 ]
Wilkerson, C [1 ]
Patt, YN [1 ]
机构
[1] Univ Texas, ECE Dept, Austin, TX 78712 USA
关键词
D O I
10.1109/HPCA.2003.1183532
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's high performance processors tolerate long latency operations by, means of out-of-order execution. However as latencies increase, the size of the instruction window must increase even faster if we are to continue to tolerate these latencies. We have already reached the point where the size of an instruction window that can handle these latencies is prohibitively large, in terms of both design complexity and power consumption. And, the problem is getting worse. This paper proposes runahead execution as an effective way to increase memory latency tolerance in an out-of-order processor, without requiring an unreasonably large instruction window. Runahead execution unblocks the instruction window blocked by long latency operations allowing the processor to execute far ahead in the program path. This results in data being prefetched into caches long before it is needed. On a machine model based on the Intel((R)) Pentium((R)) 4 processor having a 128-entry instruction window, adding runahead execution improves the IPC (Instructions Per Cycle) by 22% across a wide range of memory, intensive applications. Also, for the same machine model, runahead execution combined with a 128-entry window performs within 1% of a machine with no runahead execution and a 384-entry instruction window.
引用
下载
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [21] A Taxonomy of Out-of-Order Instruction Commit
    Alipour, Mehdi
    Carlson, Trevor E.
    Kaxiras, Stefanos
    2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 135 - 136
  • [22] High-Performance Instruction Scheduling Circuits for Superscalar Out-of-Order Soft Processors
    Wong, Henry
    Betz, Vaughn
    Rose, Jonathan
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (01)
  • [23] Modeling out-of-order processors for WCET analysis
    Li, Xianfeng
    Roychoudhury, Abhik
    Mitra, Tulika
    REAL-TIME SYSTEMS, 2006, 34 (03) : 195 - 227
  • [24] Modeling out-of-order processors for WCET analysis
    Xianfeng Li
    Abhik Roychoudhury
    Tulika Mitra
    Real-Time Systems, 2006, 34 : 195 - 227
  • [25] Unrolling shape for out-of-order processors.
    Sato, H
    INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS, 2003, : 88 - 97
  • [26] An analysis of the performance impact of wrong-path memory references on out-of order and runahead execution processors
    Mutlu, O
    Kim, H
    Armstrong, DN
    Patt, YN
    IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (12) : 1556 - 1571
  • [27] Efficient Methods for Out-of-Order Load/Store Execution for High-Performance Soft Processors
    Wong, Henry
    Betz, Vaughn
    Rose, Jonathan
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2013, : 442 - 445
  • [28] OoOJava']Java: Software Out-of-Order Execution
    Jenista, James C.
    Eom, Yong Hun
    Demsky, Brian
    ACM SIGPLAN NOTICES, 2011, 46 (08) : 57 - 67
  • [29] Efficient Out-of-Order Execution of Guarded ISAs
    Premillieu, Nathanael
    Seznec, Andre
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)
  • [30] A Novel Register Renaming Technique for Out-of-Order Processors
    Tabani, Hamid
    Arnau, Jose-Maria
    Tubella, Jordi
    Gonzalez, Antonio
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 259 - 270