Impact of GPU Memory Access Patterns on FDTD

被引：0

作者：

Livesey, Matthew ^{[1
]}

Stack, James F., Jr. ^{[1
]}

Costen, Fumie ^{[1
]}

Nanri, Takeshi ^{[1
]}

Nakashima, Norimasa ^{[1
]}

Fujino, Seiji ^{[1
]}

机构：

[1] Accenture, Manchester M21 9HE, Lancs, England

来源：

2012 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM (APSURSI) | 2012年

关键词：

GRAPHICS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The application of General Purpose computing on a GPU is an effective way to accelerate the FDTD method. This work explores the different domain decomposition techniques from the literature and extends the theoretically best approach with additional flexibility. We examine the performance on both Tesla and Fermi architecture GPUs and identify the best way to determine the GPU parameters for the proposed method.

引用

页数：2

共 50 条

[21] A Novel Parallel FDTD Algorithm on Non-Uniform Memory Access Multiprocessors
Guo, Xiaomei
2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1161 - 1163
[22] Recovering memory access patterns of executable programs
Ketterlin, Alain
Clauss, Philippe
SCIENCE OF COMPUTER PROGRAMMING, 2014, 80 : 440 - 456
[23] A Simple GPU Implementation of FDTD/PBC Algorithm
Demir, Veysel
2015 31st International Review of Progress in Applied Computational Electromagnetics (ACES) Vol 31, 2015,
[24] The Research of Parallel FDTD Method Based on GPU
Zhou, Wei
Li, Lizheng
Cheng, Xinming
2015 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND INTELLIGENT CONTROL (ISIC 2015), 2015, : 404 - 408
[25] GPU based FDTD solver with CPML boundaries
Inman, Matthew J.
Elsherbeni, Atef Z.
Maloney, James G.
Baker, Bradford N.
2007 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM, VOLS 1-12, 2007, : 4785 - +
[26] A GPU approach to FDTD for Radio Coverage Prediction
Valcarce, Alvaro
De La Roche, Guillaume
Zhang, Jie
2008 11TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), VOLS 1-3, 2008, : 1585 - 1590
[27] Memory access protocols: certified data-race freedom for GPU kernels
Cogumbreiro, Tiago
Lange, Julien
Liew, Dennis
Zicarelli, Hannah
FORMAL METHODS IN SYSTEM DESIGN, 2024, 63 (1-3) : 134 - 171
[28] A Memory Access Reduced Sort on Multi-core GPU<bold> </bold>
Guo, Chengxin
Chen, Hong
Li, Cuiping
Wu, Tianzhen
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 586 - 593
[29] Accelerating computation of Euclidean distance map using the GPU with efficient memory access
Man, Duhu
Uda, Kenji
Ito, Yasuaki
Nakano, Koji
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2013, 28 (05) : 383 - 406
[30] Optimizing non-coalesced memory access for irregular applications with GPU computing
Zheng, Ran
Liu, Yuan-dong
Jin, Hai
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (09) : 1285 - 1301

← 1 2 3 4 5 →