Exploiting procedure level locality to reduce instruction cache misses

被引：0

作者：

Batchu, RV ^{[1
]}

Jiménez, DA ^{[1
]}

机构：

[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA

来源：

EIGHTH WORKSHOP ON INTERACTION BETWEEN COMPILERS AND COMPUTER ARCHITECTURES, PROCEEDINGS | 2004年

关键词：

D O I：

10.1109/INTERA.2004.1299512

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

High instruction fetch bandwidth is essential for high performance in today's wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.

引用

页码：75 / 84

页数：10

共 50 条

[31] ICFP: TOLERATING ALL-LEVEL CACHE MISSES IN IN-ORDER PROCESSORS
Hilton, Andrew
Nagarakatte, Santosh
Roth, Amir
IEEE MICRO, 2010, 30 (01) : 12 - 19
[32] Effective Data Placement to Reduce Cache Thrashing in Last Level Cache
Ross, William
Lee, Byeong Kil
16TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY-NEW GENERATIONS (ITNG 2019), 2019, 800 : 291 - 296
[33] Fat Loads: Exploiting Locality Amongst Contemporaneous Load Operations to Optimize Cache Accesses
Baoni, Vanshika
Mittal, Adarsh
Sohi, Gurindar S.
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 366 - 379
[34] Combining optimization for cache and instruction-level parallelism
Carr, S
PROCEEDINGS OF THE 1996 CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT '96), 1996, : 238 - 247
[35] Exploiting Data Locality in Memory for ORAM to Reduce Memory Access Overheads
Kuang, Jinxi
Shen, Minghua
Lu, Yutong
Xiao, Nong
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 703 - 708
[36] Reducing the performance impact of instruction cache misses by writing instructions into the reservation stations out-of-order
Stark, J
Racunas, P
Patt, YN
THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 34 - 43
[37] Exploiting Locality to Improve Circuit-level Timing Speculation
Xin, Jing
Joseph, Russ
IEEE COMPUTER ARCHITECTURE LETTERS, 2009, 8 (02) : 40 - 43
[38] Temporal-based procedure reordering for improved instruction cache performance
Kalamatianos, J
Kaeli, DR
1998 FOURTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1998, : 244 - 253
[39] Reducing the second-level cache conflict misses using a set folding technique
Shatnawi, Ali
Alsaedeen, Mohammad
JOURNAL OF SUPERCOMPUTING, 2018, 74 (02): : 970 - 993
[40] Reducing the second-level cache conflict misses using a set folding technique
Ali Shatnawi
Mohammad Alsaedeen
The Journal of Supercomputing, 2018, 74 : 970 - 993

← 1 2 3 4 5 →