Runahead execution: An alternative to very large instruction windows for out-of-order processors

被引：152

作者：

Mutlu, O ^{[1
]}

Stark, J ^{[1
]}

Wilkerson, C ^{[1
]}

Patt, YN ^{[1
]}

机构：

[1] Univ Texas, ECE Dept, Austin, TX 78712 USA

来源：

NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS | 2003年

关键词：

D O I：

10.1109/HPCA.2003.1183532

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Today's high performance processors tolerate long latency operations by, means of out-of-order execution. However as latencies increase, the size of the instruction window must increase even faster if we are to continue to tolerate these latencies. We have already reached the point where the size of an instruction window that can handle these latencies is prohibitively large, in terms of both design complexity and power consumption. And, the problem is getting worse. This paper proposes runahead execution as an effective way to increase memory latency tolerance in an out-of-order processor, without requiring an unreasonably large instruction window. Runahead execution unblocks the instruction window blocked by long latency operations allowing the processor to execute far ahead in the program path. This results in data being prefetched into caches long before it is needed. On a machine model based on the Intel((R)) Pentium((R)) 4 processor having a 128-entry instruction window, adding runahead execution improves the IPC (Instructions Per Cycle) by 22% across a wide range of memory, intensive applications. Also, for the same machine model, runahead execution combined with a 128-entry window performs within 1% of a machine with no runahead execution and a 384-entry instruction window.

引用

页码：129 / 140

页数：12

共 50 条

[1] Runahead execution: An effective alternative to large instruction windows
Mutlu, O
Stark, J
Wilkerson, C
Patt, YN
[J]. IEEE MICRO, 2003, 23 (06) : 20 - 25
[2] Data-flow prescheduling for large instruction windows in out-of-order processors
Michaud, P
Seznec, A
[J]. HPCA: SEVENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTING ARCHITECTURE, PROCEEDINGS, 2001, : 27 - 36
[3] INTERRUPT HANDLING FOR OUT-OF-ORDER EXECUTION PROCESSORS
TORNG, HC
DAY, M
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1993, 42 (01) : 122 - 127
[4] Direct instruction wakeup for out-of-order processors
Ramírez, MA
Cristal, A
Veidenbaum, AV
Villa, L
Valero, M
[J]. INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS, PROCEEDINGS, 2004, : 2 - 9
[5] Improving branch prediction and predicated execution in out-of-order processors
Quinones, Eduardo
Parcerisa, Joan-Manuel
Gonzalez, Antonio
[J]. THIRTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2007, : 75 - +
[6] An exploration of instruction fetch requirement in out-of-order superscalar processors
Michaud, P
Seznec, A
Jourdan, S
[J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2001, 29 (01) : 35 - 58
[7] An Exploration of Instruction Fetch Requirement in Out-of-Order Superscalar Processors
Pierre Michaud
André Seznec
Stéphan Jourdan
[J]. International Journal of Parallel Programming, 2001, 29 : 35 - 58
[8] Out-of-order commit processors
Cristal, A
Ortega, D
Llosa, J
Valero, M
[J]. 10TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2004, : 48 - 59
[9] High Performance Instruction Scheduling Circuits for Out-of-Order Soft Processors
Wong, Henry
Betz, Vaughn
Rose, Jonathan
[J]. 2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 9 - 16
[10] Can out-of-order instruction execution in multiprocessors be made sequentially consistent?
Higham, L
Kawash, J
[J]. NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 261 - 265

← 1 2 3 4 5 →