Memory latency in distributed shared-memory multiprocessors

被引:0
|
作者
Motlagh, BS [1 ]
DeMara, RF [1 ]
机构
[1] Univ Cent Florida, Dept Engn Technol, Orlando, FL 32826 USA
关键词
D O I
10.1109/SECON.1998.673311
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Analytical models were developed and simulations of memory latency were performed for Uniform Memory Access (UMA), Non-Uniform Memory Access (NUMA), Local-Remote-Global (LRG), and replicated Concurrent-Read (RCR) architectures for hit rates from 0.1 to 0.9 in steps of 0.1, memory access times of 10 nsec to 100 nsec, proportions of read/write access from 0.01 to 0.1, and block sites of 8 to 64 words. The RCR architecture based on redundant inexpensive DRAM is shown to provide favorable performance over UMA and NUMA architectures for application and system parameters in the range evaluated. RCR outperforms LRC: architectures when the hit rates of the processor cache exceed 80% and hit rates of replicated memory exceed 25%. Inclusion of a small replicated memory at each processor significantly reduces expected access time since all replicated memory READ access hits become independent of global traffic. For configurations of up to 32 processors, results show that latency is further reduced by distinguishing burst-mode transfers between isolated memory accesses and those which are incrementally outside the working set.
引用
收藏
页码:134 / 137
页数:4
相关论文
共 50 条
  • [41] Parallel classification for data mining on shared-memory multiprocessors
    Zaki, MJ
    Ho, CT
    Agrawal, R
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 198 - 205
  • [42] Linear and extended linear transformations for shared-memory multiprocessors
    Kulkarni, D
    Stumm, M
    COMPUTER JOURNAL, 1997, 40 (06): : 373 - 387
  • [43] An architecture for tolerating processor failures in shared-memory multiprocessors
    Banatre, M
    Gefflaut, A
    Joubert, P
    Morin, C
    Lee, PA
    IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (10) : 1101 - 1115
  • [44] Token coherence: A new framework for shared-memory multiprocessors
    Martin, MMK
    Hill, MD
    Wood, DA
    IEEE MICRO, 2003, 23 (06) : 108 - 116
  • [45] Perceptron based consumer prediction in shared-memory multiprocessors
    Leventhal, Sean
    Franklin, Manoj
    PROCEEDINGS 2006 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2007, : 148 - 154
  • [46] A SUPERNODAL CHOLESKY FACTORIZATION ALGORITHM FOR SHARED-MEMORY MULTIPROCESSORS
    NG, E
    PEYTON, BW
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (04): : 761 - 769
  • [47] Matrix multiplication performance on commodity shared-memory multiprocessors
    Tsilikas, G
    Fleury, M
    INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2004, : 13 - 18
  • [48] Parallel intersection counting on shared-memory multiprocessors and GPUs
    Marzolla, Moreno
    Birolo, Giovanni
    D'Angelo, Gabriele
    Fariselli, Piero
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 : 423 - 431
  • [49] Interactive volume rendering on clusters of shared-memory multiprocessors
    Palmer, ME
    Taylor, S
    Totty, B
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: IMPLEMENTATIONS AND RESULTS USING PARALLEL COMPUTERS, 1996, : 323 - 330
  • [50] Computing global virtual time in shared-memory multiprocessors
    Georgia Inst of Technology, Atlanta, United States
    ACM Trans Model Comput Simul, 4 (425-446):