Exploiting procedure level locality to reduce instruction cache misses

被引:0
|
作者
Batchu, RV [1 ]
Jiménez, DA [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
关键词
D O I
10.1109/INTERA.2004.1299512
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High instruction fetch bandwidth is essential for high performance in today's wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
  • [1] Code positioning to reduce instruction cache misses in signal processing applications on multimedia risc processors
    Stolberg, HJ
    Ikekawa, M
    Kuroda, I
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 699 - 702
  • [2] A large, fast instruction window for tolerating cache misses
    Lebeck, AR
    Koppanalil, J
    Li, T
    Patwardhan, J
    Rotenberg, E
    29TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2002, : 59 - 70
  • [3] Exploiting Cache Locality to Speedup Register Clustering
    Fontana, Tiago Augusto
    Almeida, Sheiny
    Netto, Renan
    Livramento, Vinicius
    Laercio Pilla, Chrystian Guth
    Guntzel, Jose Luis
    2017 30TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI 2017): CHOP ON SANDS, 2017, : 191 - 197
  • [4] PREFETCHING PACING BUFFER TO REDUCE CACHE MISSES.
    Pomerene, J.H.
    Puzak, T.R.
    Rechtschaffen, R.N.
    Sparacio, F.J.
    IBM technical disclosure bulletin, 1984, 27 (05): : 2773 - 2774
  • [5] An effective instruction cache prefetch policy by exploiting cache history information
    Shin, SH
    Kim, CH
    Jhon, CS
    EMBEDDED AND UBIQUITOUS COMPUTING - EUC 2005, 2005, 3824 : 57 - 66
  • [6] Instruction prefetching of systems codes with layout optimized for reduced cache misses
    Xia, C
    Torrellas, J
    23RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 1996, : 271 - 282
  • [7] Reshaping Cache Misses to Improve Row-Buffer Locality in Multicore Systems
    Ding, Wei
    Liu, Jun
    Kandemir, Mahmut
    Irwin, Mary Jane
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 235 - 244
  • [8] Exploiting image processing locality in cache pre-fetching
    Cucchiara, R
    Piccardi, M
    FIFTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 1998, : 466 - 472
  • [9] DECOUPLED COMPRESSED CACHE: EXPLOITING SPATIAL LOCALITY FOR ENERGY OPTIMIZATION
    Sardashti, Somayeh
    Wood, David A.
    IEEE MICRO, 2014, 34 (03) : 91 - 99
  • [10] Cache decay: Exploiting generational behavior to reduce cache leakage power
    Kaxiras, S
    Hu, ZG
    Martonosi, M
    28TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2001, : 240 - 251