Equidistant Memory Access Coalescing on GPGPU

被引:0
|
作者
Pei, Yulong [1 ]
Yu, Licheng [1 ]
Wu, Minghui [2 ]
Chen, Tianzhou [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ City Coll, Hangzhou, Zhejiang, Peoples R China
关键词
memory access coalescing; memory access distance; GPGPU;
D O I
10.1109/HPCC-CSS-ICESS.2015.14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the massive processing power, GPGPU can execute thousands of threads in parallel at the cost of high memory bandwidth to support the large number of concurrent memory requests. To alleviate the demands, GPGPU adopts memory access coalescing to reduce the memory requests issued to memory system. In this paper, we first introduced the concept of memory access distance, and classify GPGPU programs into three types according to their memory access distances. we discovered that programs with large but equal memory access distance fare popular in GPGPU, which, however, cannot be optimized by the original memory access coalescing. Thus, we proposed equidistant memory access coalescing, which is able to merge requests with any equal memory access distance. We evaluated our method with 30 benchmarks. Compared with original memory access coalescing, equidistant memory access coalescing can improve performance of 19 benchmarks among them. For the benchmarks with equal and large memory access distance, the average speedup is 151% and the maximum speedup is 200%. The memory access requests are reduced to 32% on average.
引用
收藏
页码:272 / 277
页数:6
相关论文
共 50 条
  • [1] Two methods for combining original memory access coalescing and equivalent memory access coalescing on GPGPU
    Pei, Yulong
    Yu, Licheng
    Wu, Minghui
    Chen, Tianzhou
    Lou, Xueqing
    Zhang, Tiefei
    [J]. 2016 13TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS) - PROCEEDINGS, 2016, : 48 - 53
  • [2] MEMORY ACCESS COALESCING - A TECHNIQUE FOR ELIMINATING REDUNDANT MEMORY ACCESSES
    DAVIDSON, JW
    JINTURKAR, S
    [J]. SIGPLAN NOTICES, 1994, 29 (06): : 186 - 195
  • [3] Energy-Efficient Stream Compaction Through Filtering and Coalescing Accesses in GPGPU Memory Partitions
    Segura, Albert
    Arnau, Jose-Maria
    Gonzalez, Antonio
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (07) : 1711 - 1723
  • [4] Collaborative Coalescing of Redundant Memory Access for GPU System
    Jiang, Fan
    Li, Chengeng
    Zhang, Wei
    Xu, Jiang
    [J]. 29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 195 - 200
  • [5] Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
    Albert Segura
    Jose Maria Arnau
    Antonio Gonzalez
    [J]. The Journal of Supercomputing, 2023, 79 : 762 - 787
  • [6] Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
    Segura, Albert
    Arnau, Jose Maria
    Gonzalez, Antonio
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (01): : 762 - 787
  • [7] An access pattern based adaptive mapping function for GPGPU scratchpad memory
    Han, Feng
    Li, Li
    Wang, Kun
    Feng, Fan
    Pan, Hongbing
    Sha, Jin
    Lin, Jun
    [J]. IEICE ELECTRONICS EXPRESS, 2017, 14 (12):
  • [8] Analyzing Memory Access on CPU-GPGPU Shared LLC Architecture
    Ma, Jianliang
    Yu, Licheng
    Chen, Tianzhou
    Wu, Minghui
    [J]. 2015 14TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2015, : 90 - 99
  • [9] Control Flow Coalescing on a Hybrid Dataflow/von Neumann GPGPU
    Voitsechov, Dani
    Etsion, Yoav
    [J]. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 216 - 227
  • [10] Memory Coalescing for Hybrid Memory Cube
    Wang, Xi
    Leidel, John D.
    Chen, Yong
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,