Equidistant Memory Access Coalescing on GPGPU

被引：0

作者：

Pei, Yulong ^{[1
]}

Yu, Licheng ^{[1
]}

Wu, Minghui ^{[2
]}

Chen, Tianzhou ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China

[2] Zhejiang Univ City Coll, Hangzhou, Zhejiang, Peoples R China

来源：

2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS) | 2015年

关键词：

memory access coalescing; memory access distance; GPGPU;

D O I：

10.1109/HPCC-CSS-ICESS.2015.14

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

With the massive processing power, GPGPU can execute thousands of threads in parallel at the cost of high memory bandwidth to support the large number of concurrent memory requests. To alleviate the demands, GPGPU adopts memory access coalescing to reduce the memory requests issued to memory system. In this paper, we first introduced the concept of memory access distance, and classify GPGPU programs into three types according to their memory access distances. we discovered that programs with large but equal memory access distance fare popular in GPGPU, which, however, cannot be optimized by the original memory access coalescing. Thus, we proposed equidistant memory access coalescing, which is able to merge requests with any equal memory access distance. We evaluated our method with 30 benchmarks. Compared with original memory access coalescing, equidistant memory access coalescing can improve performance of 19 benchmarks among them. For the benchmarks with equal and large memory access distance, the average speedup is 151% and the maximum speedup is 200%. The memory access requests are reduced to 32% on average.

引用

页码：272 / 277

页数：6

共 50 条

[1] Two methods for combining original memory access coalescing and equivalent memory access coalescing on GPGPU
Pei, Yulong
Yu, Licheng
Wu, Minghui
Chen, Tianzhou
Lou, Xueqing
Zhang, Tiefei
[J]. 2016 13TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS) - PROCEEDINGS, 2016, : 48 - 53
[2] MEMORY ACCESS COALESCING - A TECHNIQUE FOR ELIMINATING REDUNDANT MEMORY ACCESSES
DAVIDSON, JW
JINTURKAR, S
[J]. SIGPLAN NOTICES, 1994, 29 (06): : 186 - 195
[3] Energy-Efficient Stream Compaction Through Filtering and Coalescing Accesses in GPGPU Memory Partitions
Segura, Albert
Arnau, Jose-Maria
Gonzalez, Antonio
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (07) : 1711 - 1723
[4] Collaborative Coalescing of Redundant Memory Access for GPU System
Jiang, Fan
Li, Chengeng
Zhang, Wei
Xu, Jiang
[J]. 29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 195 - 200
[5] Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
Albert Segura
Jose Maria Arnau
Antonio Gonzalez
[J]. The Journal of Supercomputing, 2023, 79 : 762 - 787
[6] Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
Segura, Albert
Arnau, Jose Maria
Gonzalez, Antonio
[J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (01): : 762 - 787
[7] An access pattern based adaptive mapping function for GPGPU scratchpad memory
Han, Feng
Li, Li
Wang, Kun
Feng, Fan
Pan, Hongbing
Sha, Jin
Lin, Jun
[J]. IEICE ELECTRONICS EXPRESS, 2017, 14 (12):
[8] Analyzing Memory Access on CPU-GPGPU Shared LLC Architecture
Ma, Jianliang
Yu, Licheng
Chen, Tianzhou
Wu, Minghui
[J]. 2015 14TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2015, : 90 - 99
[9] Control Flow Coalescing on a Hybrid Dataflow/von Neumann GPGPU
Voitsechov, Dani
Etsion, Yoav
[J]. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 216 - 227
[10] Memory Coalescing for Hybrid Memory Cube
Wang, Xi
Leidel, John D.
Chen, Yong
[J]. PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,

← 1 2 3 4 5 →