An evaluation of memory consistency models for shared-memory systems with ILP processors

被引:5
|
作者
Pai, VS
Ranganathan, P
Adve, SV
Harton, T
机构
[1] Dept. of Elec. and Comp. Engineering, Rice University, Houston
关键词
D O I
10.1145/248209.237142
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Relaxed consistency models have been shown to significantly outperform sequential consistency for single-issue, statically scheduled processors with blocking reads. However, current microprocessors aggressively exploit instruction-level parallelism (ILP) using methods such as multiple issue, dynamic scheduling, and non-blocking reads. Researchers have conjectured that two techniques, hardware-controlled non-binding prefetching and speculative loads, have the potential to equalize the hardware performance of memory consistency models on such processors, This paper performs the first detailed quantitative comparison of several implementations of sequential consistency and release consistency optimized for aggressive ILP processors. Our results indicate that hardware prefetching and speculative loads dramatically improve the performance of sequential consistency. However, the gap between sequential consistency and release consistency depends on the cache write policy and the complexity of the cache-coherence protocol implementation. In most cases, release consistency significantly outperforms sequential consistency, but for two applications, the use of a write-back primary cache and a more complex cache-coherence protocol nearly equalizes the performance of the two models. We also observe that the existing techniques, which require on-chip hardware modifications, enhance the performance of release consistency only to a smell extent. We propose two new software techniques - fuzzy acquires and selective acquires - to achieve more overlap than allowed by the previous implementations of release consistency. To enhance methods for overlapping acquires, we also propose a technique to eliminate control dependences caused by an acquire loop, using a small amount of off-chip hardware called the synchronization buffer.
引用
收藏
页码:12 / 23
页数:12
相关论文
共 50 条
  • [1] Analytic evaluation of shared-memory systems with ILP processors
    Sorin, DJ
    Pai, VS
    Adve, SV
    Vernon, MK
    Wood, DA
    [J]. 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 1998, : 380 - 391
  • [2] The interaction of software prefetching with ILP processors in shared-memory systems
    Ranganathan, P
    Pai, VS
    AbdelShafi, H
    Adve, SV
    [J]. 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, CONFERENCE PROCEEDINGS, 1997, : 144 - 156
  • [3] Rsim: Simulating shared-memory multiprocessors with ILP processors
    Hughes, CJ
    Pai, VS
    Ranganathan, P
    Adve, SV
    [J]. COMPUTER, 2002, 35 (02) : 40 - +
  • [4] PERFORMANCE EVALUATION OF MEMORY CONSISTENCY MODELS FOR SHARED-MEMORY MULTIPROCESSORS
    GHARACHORLOO, K
    GUPTA, A
    HENNESSY, J
    [J]. SIGPLAN NOTICES, 1991, 26 (04): : 245 - 257
  • [5] Specification and verification of memory consistency models for shared-memory multiprocessor systems
    Takata, S
    Taguchi, K
    Joe, K
    Fukuda, A
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 923 - 930
  • [6] SHARED-MEMORY CONTROLLERS LINK PROCESSORS
    AOUIZERAT, R
    [J]. MINI-MICRO SYSTEMS, 1983, 16 (11): : 272 - 274
  • [7] Improving the accuracy vs. speed tradeoff for simulating shared-memory multiprocessors with ILP processors
    Durbhakula, M
    Pai, VS
    Adve, S
    [J]. FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 23 - 32
  • [8] A pipelined shared-memory architecture for FFT processors
    Jia, LH
    Gao, YH
    Tenhunen, H
    [J]. 42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 804 - 807
  • [9] Grouping memory consistency model for parallel-multithreaded shared-memory multiprocessor systems
    Wu, CC
    Chen, C
    [J]. INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1999, 10 (01): : 53 - 81
  • [10] Memory consistency models for shared memory multiprocessors and DSM systems
    Protic, J
    Tartalja, I
    Tomasevic, M
    [J]. MELECON '96 - 8TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, PROCEEDINGS, VOLS I-III: INDUSTRIAL APPLICATIONS IN POWER SYSTEMS, COMPUTER SCIENCE AND TELECOMMUNICATIONS, 1996, : 1112 - 1115