Impact of GPU Memory Access Patterns on FDTD

被引:0
|
作者
Livesey, Matthew [1 ]
Stack, James F., Jr. [1 ]
Costen, Fumie [1 ]
Nanri, Takeshi [1 ]
Nakashima, Norimasa [1 ]
Fujino, Seiji [1 ]
机构
[1] Accenture, Manchester M21 9HE, Lancs, England
关键词
GRAPHICS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The application of General Purpose computing on a GPU is an effective way to accelerate the FDTD method. This work explores the different domain decomposition techniques from the literature and extends the theoretically best approach with additional flexibility. We examine the performance on both Tesla and Fermi architecture GPUs and identify the best way to determine the GPU parameters for the proposed method.
引用
收藏
页数:2
相关论文
共 50 条
  • [21] A Novel Parallel FDTD Algorithm on Non-Uniform Memory Access Multiprocessors
    Guo, Xiaomei
    2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1161 - 1163
  • [22] Recovering memory access patterns of executable programs
    Ketterlin, Alain
    Clauss, Philippe
    SCIENCE OF COMPUTER PROGRAMMING, 2014, 80 : 440 - 456
  • [23] A Simple GPU Implementation of FDTD/PBC Algorithm
    Demir, Veysel
    2015 31st International Review of Progress in Applied Computational Electromagnetics (ACES) Vol 31, 2015,
  • [24] The Research of Parallel FDTD Method Based on GPU
    Zhou, Wei
    Li, Lizheng
    Cheng, Xinming
    2015 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND INTELLIGENT CONTROL (ISIC 2015), 2015, : 404 - 408
  • [25] GPU based FDTD solver with CPML boundaries
    Inman, Matthew J.
    Elsherbeni, Atef Z.
    Maloney, James G.
    Baker, Bradford N.
    2007 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM, VOLS 1-12, 2007, : 4785 - +
  • [26] A GPU approach to FDTD for Radio Coverage Prediction
    Valcarce, Alvaro
    De La Roche, Guillaume
    Zhang, Jie
    2008 11TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), VOLS 1-3, 2008, : 1585 - 1590
  • [27] Memory access protocols: certified data-race freedom for GPU kernels
    Cogumbreiro, Tiago
    Lange, Julien
    Liew, Dennis
    Zicarelli, Hannah
    FORMAL METHODS IN SYSTEM DESIGN, 2024, 63 (1-3) : 134 - 171
  • [28] A Memory Access Reduced Sort on Multi-core GPU<bold> </bold>
    Guo, Chengxin
    Chen, Hong
    Li, Cuiping
    Wu, Tianzhen
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 586 - 593
  • [29] Accelerating computation of Euclidean distance map using the GPU with efficient memory access
    Man, Duhu
    Uda, Kenji
    Ito, Yasuaki
    Nakano, Koji
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2013, 28 (05) : 383 - 406
  • [30] Optimizing non-coalesced memory access for irregular applications with GPU computing
    Zheng, Ran
    Liu, Yuan-dong
    Jin, Hai
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (09) : 1285 - 1301