Multiple branch prediction for wide-issue superscalar

被引:1
|
作者
Hwang, SL [1 ]
Chen, CC
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
关键词
branch prediction; wide-issue superscalar; branch target buffer; branch penalty;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern micro-architectures employ superscalar techniques to enhance system performance. Since the superscalar microprocessors must fetch at least one instruction cache line at a time to support high issue rate and large amount speculative executions. There are cases that multiple branches are often encountered in one cycle. And in practical implementation this would cause serious problem while there are variable number of instruction addresses that look up the Branch Target Buffer simultaneously. In this paper, we propose a Range Associative Branch Target Buffer (RABTB) that can recognize and predict multiple branches in the same instruction cache line for a wide-issue micro-architecture. Several configurations of the RABTB are simulated and compared using the SPECint95 benchmarks. We show that with a reasonable size of prediction scope, branch prediction can be improved by supporting multiple / up to 8 branch predictions in one cache line in one cycle. Our simulation results show that the optimal RABTB should be 2048 entry, 8-column range-associate and 8-entry modified ring buffer architecture using PAs prediction algorithm. It has an average 5.2 IPC_f and branch penalty per branch of 0.54 cycles. This is almost two times better than a mechanism that makes prediction only on the first encountered branch.
引用
收藏
页码:1154 / 1166
页数:13
相关论文
共 50 条
  • [31] Superscalar instruction issue in an asynchronous microprocessor
    Endecott, PB
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1996, 143 (05): : 266 - 272
  • [32] Using rewriting rules and Positive Equality to formally verify wide-issue out-of-order microprocessors with a Reorder Buffer
    Velev, MN
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, 2002 PROCEEDINGS, 2002, : 28 - 35
  • [33] Prediction caches for superscalar processors
    Bennett, JE
    Flynn, MJ
    THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 81 - 90
  • [34] SPECULATIVE EXECUTION AND REDUCING BRANCH PENALTY ON A SUPERSCALAR PROCESSOR
    ANDO, H
    NAKANISHI, C
    MACHIDA, H
    HARA, T
    NAKAYA, M
    IEICE TRANSACTIONS ON ELECTRONICS, 1993, E76C (07) : 1080 - 1093
  • [35] On the design complexity of the issue logic of superscalar machines
    Cotofana, S
    Vassiliadis, S
    24TH EUROMICRO CONFERENCE - PROCEEDING, VOLS 1 AND 2, 1998, : 277 - 284
  • [36] Simplifying instruction issue logic in superscalar processors
    Sato, T
    Arita, I
    EUROMICRO SYMPOSIUM ON DIGITAL SYSTEM DESIGN, PROCEEDINGS: ARCHITECTURES, METHODS AND TOOLS, 2002, : 341 - 346
  • [37] Circuits for wide-window superscalar processors
    Henry, DS
    Kuszmaul, BC
    Loh, GH
    Sami, R
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 236 - 247
  • [38] Explicitly declared delayed-branch mechanism for a superscalar architecture
    Collins, Roger, 1600, Elsevier Science B.V., Amsterdam, Netherlands (40): : 10 - 12
  • [39] AN EXPLICITLY DECLARED DELAYED-BRANCH MECHANISM FOR A SUPERSCALAR ARCHITECTURE
    COLLINS, R
    STEVEN, G
    MICROPROCESSING AND MICROPROGRAMMING, 1994, 40 (10-12): : 677 - 680
  • [40] Multiple context multithreaded superscalar processor architecture
    Loh, KS
    Wong, WF
    JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (03) : 243 - 258