Detection and GPU Accelerationof 3D FDTD Algorithms Based on Memory Access Patterns

被引:0
|
作者
Shao, Ran [1 ]
Linton, David [1 ]
Spence, Ivor [1 ]
Milligan, Peter [1 ]
Zheng, Ning [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast, Antrim, North Ireland
关键词
FDTD; Memory access pattern; LLVM; CUDA; 5.0; RADIATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A semi-automatic tool is reported that first analyzes the sequential FDTD program to obtain memory access patterns and related features, and then optimizes the FDTD program with combined use of several types of CUDA memory on both Fermi and Kepler architecture GPUs. The experiments show a 13% and 18% speedup using Fermi and Kepler GPUs respectively compared to the GPU version program without optimization. Up to 142 times speedup is achieved compared to the sequential FDTD C program at a FDTD 3D mesh size of 250*250*250 (15.625 million mesh cells) with 10 layers CPML boundary conditions in 4096 time steps.
引用
收藏
页码:2520 / 2526
页数:7
相关论文
共 50 条
  • [1] Impact of GPU Memory Access Patterns on FDTD
    Livesey, Matthew
    Stack, James F., Jr.
    Costen, Fumie
    Nanri, Takeshi
    Nakashima, Norimasa
    Fujino, Seiji
    2012 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM (APSURSI), 2012,
  • [2] Efficient Memory Access Patterns for Solving 3D Laplace Equation on GPU
    Akhtar, Muhammad Naveed
    Durad, Muhammad Hanif
    Usman, Anila
    Mughal, Muhammad Abid
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE, 2018, 42 (A2): : 623 - 633
  • [3] Efficient Memory Access Patterns for Solving 3D Laplace Equation on GPU
    Muhammad Naveed Akhtar
    Muhammad Hanif Durad
    Anila Usman
    Muhammad Abid Mughal
    Iranian Journal of Science and Technology, Transactions A: Science, 2018, 42 : 623 - 633
  • [4] GPU-based accelerated 2D and 3D FDTD solvers
    Price, Daniel K.
    Humphrey, John R.
    Kelmelis, Eric J.
    PHYSICS AND SIMULATION OF OPTOELECTRONIC DEVICES XV, 2007, 6468
  • [5] Aspects of GPU Perfomance in Algorithms with Random Memory Access
    Kashkovsky, Alexander V.
    Shershnev, Anton A.
    Vashchenkov, Pavel V.
    PROCEEDINGS OF THE XXV CONFERENCE ON HIGH-ENERGY PROCESSES IN CONDENSED MATTER (HEPCM 2017), 2017, 1893
  • [6] FDTD Analysis of 3D Lightning Problems with Material Uncertainties on GPU Architecture
    Pyrialakos, Georgios
    Zygiridis, Theodoros
    Kantarzis, Nikolaos
    Tsiboukis, Theodoros
    2014 INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC EUROPE), 2014, : 577 - 582
  • [7] Efficient Parallel Algorithms for 3D Laplacian Smoothing on the GPU
    Xiao, Lei
    Yang, Guoxiang
    Zhao, Kunyang
    Mei, Gang
    APPLIED SCIENCES-BASEL, 2019, 9 (24):
  • [8] Access devices for 3D crosspoint memory
    Burr, Geoffrey W.
    Shenoy, Rohit S.
    Virwani, Kumar
    Narayanan, Pritish
    Padilla, Alvaro
    Kurdi, Buelent
    Hwang, Hyunsang
    JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 2014, 32 (04):
  • [9] GPU-based Collision Detection and Response for Particles on 3D Models
    Hsieh, Hsien-Hsi
    Tai, Wen-Kai
    Chang, Chin-Chen
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2010, 26 (05) : 1619 - 1635
  • [10] 3D FDTD算法的GPU实现及优化技术研究
    宋庆增
    张二路
    韩冬
    吕华阳
    武继刚
    计算机与数字工程, 2015, 43 (10) : 1775 - 1779+1823