MRPB: Memory Request Prioritization for Massively Parallel Processors

被引:0
|
作者
Jia, Wenhao [1 ]
Shaw, Kelly A. [2 ]
Martonosi, Margaret [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Univ Richmond, Richmond, VA 23173 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high performance for a broad range of programs. They are, however, complex to program, especially because of their intricate memory hierarchies with multiple address spaces. In response, modern GPUs have widely adopted caches, hoping to providing smoother reductions in memory access traffic and latency. Unfortunately, GPU caches often have mixed or unpredictable performance impact due to cache contention that results from the high thread counts in GPUs. We propose the memory request prioritization buffer (MRPB) to ease GPU programming and improve GPU performance. This hardware structure improves caching efficiency of massively parallel workloads by applying two prioritization methods-request reordering and cache bypassing-to memory requests before they access a cache. MRPB then releases requests into the cache in a more cache-friendly order. The result is drastically reduced cache contention and improved use of the limited per-thread cache capacity. For a simulated 16KB L1 cache, MRPB improves the average performance of the entire PolyBench and Rodinia suites by 2.65 x and 1.27 x respectively, outperforming a state-of-the-art GPU cache management technique.
引用
收藏
页码:272 / 283
页数:12
相关论文
共 50 条
  • [1] ADVANTAGES OF MASSIVELY PARALLEL PROCESSORS
    ISHII, M
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 1993, 29 (01): : 3 - 5
  • [2] Massively Parallel Server Processors
    Agrawal, Varun
    Dinani, Mina Abbasi
    Shui, Yuxuan
    Ferdman, Michael
    Honarmand, Nima
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (01) : 75 - 78
  • [3] Massively Parallel Computation of Lattice Associative Memory Classifiers on Multicore Processors
    Ritter, Gerhard X.
    Schmalz, Mark S.
    Hayden, Eric T.
    MATHEMATICS OF DATA/IMAGE PATTERN CODING, COMPRESSION, AND ENCRYPTION WITH APPLICATIONS XIII, 2011, 8136
  • [4] IBM IS JUMPING INTO MASSIVELY PARALLEL PROCESSORS
    MCLEOD, J
    ELECTRONICS-US, 1992, 65 (11): : 9 - 9
  • [5] Lanczos algorithms for massively parallel processors
    Szularz, M
    Clint, M
    Weston, JS
    PARALLEL COMPUTING: FUNDAMENTALS, APPLICATIONS AND NEW DIRECTIONS, 1998, 12 : 619 - 622
  • [6] Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors
    Jeong, Jong Hyun
    Yoon, Myung Kuk
    Oh, Yunho
    Koo, Gunjae
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 546 - 555
  • [7] VLSI DESIGN FOR MASSIVELY PARALLEL SIGNAL PROCESSORS
    KUNG, SY
    ANNEVELINK, J
    MICROPROCESSORS AND MICROSYSTEMS, 1983, 7 (10) : 461 - 468
  • [8] Massively parallel processors generator for reconfigurable system
    Hamada, T
    Nakasato, N
    FCCM 2005: 13TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2005, : 329 - 330
  • [9] RISC PROCESSORS IN A MASSIVELY PARALLEL DATABASE MACHINE
    LI, QA
    RISHE, N
    TAL, D
    MICROPROCESSORS AND MICROSYSTEMS, 1990, 14 (06) : 351 - 356
  • [10] Broadband monitoring simulation with massively parallel processors
    Trubetskov, Michael
    Amotchkina, Tatiana
    Tikhonravov, Alexander
    ADVANCES IN OPTICAL THIN FILMS IV, 2011, 8168