Zero cost indexing for improved processor cache performance

被引:6
|
作者
Givargis, T [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Ctr Embedded Comp Syst, Irvine, CA 92697 USA
关键词
design; performance; cache optimization; design exploration; index hashing;
D O I
10.1145/1124713.1124715
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing use of microprocessor cores in embedded systems as well as mobile and portable devices creates an opportunity for customizing the cache subsystem for improved performance. In traditional cache design, the index portion of the memory address bus consists of the K least significant bits, where K = log(2) D and D is the depth of the cache. However, in devices where the application set is known and characterized ( e. g., systems that execute a fixed application set) there is an opportunity to improve cache performance by choosing a near-optimal set of bits used as index into the cache. This technique does not add any overhead in terms of area or delay. In this article, we present an efficient heuristic algorithm for selecting K index bits for improved cache performance. We show the feasibility of our algorithm by applying it to a large number of embedded system applications as well as the integer SPEC CPU 2000 benchmarks. Specifically, for data traces, we show up to 45% reduction in cache misses. Likewise, for instruction traces, we show up to 31% reduction in cache misses. When a unified data/instruction cache architecture is considered, our results show an average improvement of 14.5% for the Powerstone benchmarks and an average improvement of 15.2% for the SPEC'00 benchmarks.
引用
收藏
页码:3 / 25
页数:23
相关论文
共 50 条
  • [1] Improved indexing for cache miss reduction in embedded systems
    Givargis, T
    [J]. 40TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2003, 2003, : 875 - 880
  • [2] Cache Streamization for High Performance Stream Processor
    Wu, Nan
    Wen, Mei
    Ren, Ju
    He, Yi
    Xun, ChangQing
    Wu, Wei
    Zhang, Chunyuan
    [J]. 16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS, 2009, : 140 - 149
  • [3] Impact of Processor Cache Memory on Storage Performance
    Kim, Young-kuen
    Song, Yong Ho
    [J]. PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 304 - 305
  • [4] Improving the Performance of Adaptive Cache in Reconfigurable VLIW Processor
    Hu, Sensen
    Brandon, Anthony
    Guo, Qi
    Wang, Yizhuo
    [J]. APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 3 - 15
  • [5] Performance analysis of cache and scratchpad memory in an embedded high performance processor
    Dias, Wanessa Pereira
    Colonese, Emilia
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2008, : 657 - 661
  • [6] A smart cache for improved vector performance
    Gschwind, MK
    Pietsch, TJ
    [J]. COMPUTING SYSTEMS IN ENGINEERING, 1995, 6 (4-5): : 459 - 464
  • [7] BOOST PROCESSOR PERFORMANCE WITH 2-LEVEL CACHE MEMORY
    DEVANE, CJ
    LIDINGTON, G
    [J]. ELECTRONIC DESIGN, 1988, 36 (13) : 97 - &
  • [8] Simplified multi-ported cache in high performance processor
    Zhang, Hao
    Fan, Dongrui
    [J]. INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE, AND STORAGE, PROCEEDINGS, 2007, : 9 - +
  • [9] Performance Advantage of Reconfigurable Cache Design on Multicore Processor Systems
    Jie Tao
    Marcel Kunze
    Fabian Nowak
    Rainer Buchty
    Wolfgang Karl
    [J]. International Journal of Parallel Programming, 2008, 36 : 347 - 360
  • [10] Impact of register-cache bandwidth variation on processor performance
    Hamayasu, K
    Moshnyaga, VG
    [J]. ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2004, 3189 : 212 - 225