Exploiting procedure level locality to reduce instruction cache misses

被引:0
|
作者
Batchu, RV [1 ]
Jiménez, DA [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
关键词
D O I
10.1109/INTERA.2004.1299512
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High instruction fetch bandwidth is essential for high performance in today's wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
  • [31] ICFP: TOLERATING ALL-LEVEL CACHE MISSES IN IN-ORDER PROCESSORS
    Hilton, Andrew
    Nagarakatte, Santosh
    Roth, Amir
    IEEE MICRO, 2010, 30 (01) : 12 - 19
  • [32] Effective Data Placement to Reduce Cache Thrashing in Last Level Cache
    Ross, William
    Lee, Byeong Kil
    16TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY-NEW GENERATIONS (ITNG 2019), 2019, 800 : 291 - 296
  • [33] Fat Loads: Exploiting Locality Amongst Contemporaneous Load Operations to Optimize Cache Accesses
    Baoni, Vanshika
    Mittal, Adarsh
    Sohi, Gurindar S.
    PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 366 - 379
  • [34] Combining optimization for cache and instruction-level parallelism
    Carr, S
    PROCEEDINGS OF THE 1996 CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT '96), 1996, : 238 - 247
  • [35] Exploiting Data Locality in Memory for ORAM to Reduce Memory Access Overheads
    Kuang, Jinxi
    Shen, Minghua
    Lu, Yutong
    Xiao, Nong
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 703 - 708
  • [36] Reducing the performance impact of instruction cache misses by writing instructions into the reservation stations out-of-order
    Stark, J
    Racunas, P
    Patt, YN
    THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 34 - 43
  • [37] Exploiting Locality to Improve Circuit-level Timing Speculation
    Xin, Jing
    Joseph, Russ
    IEEE COMPUTER ARCHITECTURE LETTERS, 2009, 8 (02) : 40 - 43
  • [38] Temporal-based procedure reordering for improved instruction cache performance
    Kalamatianos, J
    Kaeli, DR
    1998 FOURTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1998, : 244 - 253
  • [39] Reducing the second-level cache conflict misses using a set folding technique
    Shatnawi, Ali
    Alsaedeen, Mohammad
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (02): : 970 - 993
  • [40] Reducing the second-level cache conflict misses using a set folding technique
    Ali Shatnawi
    Mohammad Alsaedeen
    The Journal of Supercomputing, 2018, 74 : 970 - 993