Efficient integration of compiler-directed cache coherence and data prefetching

被引:1
|
作者
Lim, HB [1 ]
Yew, PC
机构
[1] Hewlett Packard Corp, Cupertino, CA 95014 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
compiler-directed cache coherence; data prefetching; memory latency hiding; shared-memory multiprocessors;
D O I
10.1006/jpdc.2001.1784
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cache coherence enforcement and memory latency reduction and hiding are very important and challenging problems in the design of large-scale distributed shared-memory (DSM) multiprocessors. We propose an integrated approach to solve these problems through a compiler-directed cache coherence scheme called the Cache Coherence with Data Prefetching (CCDP) scheme. The CCDP scheme enforces cache coherence by prefetching the potentially stale references in a parallel program. It also prefetches the non-stale references to hide their memory latencies. To optimize the performance of the CCDP scheme, some prefetch hardware support is provided to efficiently handle these two forms of data prefetching operations. We also developed the compiler techniques utilized by the CCDP scheme for stale reference detection, prefetch target analysis, and prefetch scheduling. We evaluated the performance of the CCDP' scheme via execution-driven simulations of several numerical applications from the SPEC CFP95 and the Perfect benchmark suites. The simulation results show that the CCDP scheme provides significant performance improvements for the applications studied, comparable to that obtained with a full-map hardware cache coherence scheme. (C) 2001 Academic Press.
引用
收藏
页码:1775 / 1802
页数:28
相关论文
共 50 条
  • [1] Maintaining cache coherence through compiler-directed data prefetching
    Lim, HB
    Yew, PC
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 53 (02) : 144 - 173
  • [2] An integrated framework for compiler-directed Cache Coherence and Data Prefetching
    Lim, HB
    Yew, PC
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 1999, 1656 : 51 - 67
  • [3] A compiler-directed cache coherence scheme using data prefetching
    Lim, HB
    Yew, PC
    11TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM, PROCEEDINGS, 1997, : 643 - 649
  • [4] Techniques for compiler-directed cache coherence
    Choi, L
    Lim, HB
    Yew, PC
    IEEE PARALLEL & DISTRIBUTED TECHNOLOGY, 1996, 4 (04): : 23 - &
  • [5] A Compiler-Directed Data Prefetching Scheme for Chip Multiprocessors
    Son, Seung Woo
    Kandemir, Mahmut
    Karakoy, Mustafa
    Chakrabarti, Dhruva
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 209 - 218
  • [6] EFFECTIVENESS OF COMPILER-DIRECTED PREFETCHING ON DATA MINING BENCHMARKS
    Natarajan, Ragavendra
    Mekkat, Vineeth
    Hsu, Wei-Chung
    Zhai, Antonia
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2012, 21 (02)
  • [7] Compiler-directed data cache leakage reduction
    Zhang, W
    VLSI 2004: IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS, 2004, : 305 - 306
  • [8] Compiler-directed cache polymorphism
    Hu, JS
    Kandemir, M
    Vijaykrishnan, N
    Irwin, MJ
    Saputra, H
    Zhang, W
    ACM SIGPLAN NOTICES, 2002, 37 (07) : 165 - 174
  • [9] Compiler-directed cache assist adaptivity
    Ji, XM
    Nicolaescu, D
    Veidenbaum, A
    Nicolau, A
    Gupta, R
    HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2000, 1940 : 88 - 104
  • [10] COMPILER-DIRECTED CACHE MANAGEMENT IN MULTIPROCESSORS
    CHEONG, H
    VEIDENBAUM, AV
    COMPUTER, 1990, 23 (06) : 39 - 47