PARALLEL 3D FINITE-DIFFERENCE TIME-DOMAIN METHOD ON MULTI-GPU SYSTEMS

被引:5
|
作者
Du, Liu-Ge [1 ]
Li, Kang [1 ]
Kong, Fan-Min [1 ]
Hu, Yuan [1 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Jinan 250100, Peoples R China
来源
关键词
Finite-difference time-domain; graphics processing unit; convolutional perfect match layer; compute unified device architecture; PERFECTLY MATCHED LAYER; FDTD; BOUNDARY;
D O I
10.1142/S012918311101618X
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite-difference time-domain (FDTD) is a popular but computational intensive method to solve Maxwell's equations for electrical and optical devices simulation. This paper presents implementations of three-dimensional FDTD with convolutional perfect match layer (CPML) absorbing boundary conditions on graphics processing unit (GPU). Electromagnetic fields in Yee cells are calculated in parallel millions of threads arranged as a grid of blocks with compute unified device architecture (CUDA) programming model and considerable speedup factors are obtained versus sequential CPU code. We extend the parallel algorithm to multiple GPUs in order to solve electrically large structures. Asynchronous memory copy scheme is used in data exchange procedure to improve the computation efficiency. We successfully use this technique to simulate pointwise source radiation and validate the result by comparison to high precision computation, which shows favorable agreements. With four commodity GTX295 graphics cards on a single personal computer, more than 4000 million Yee cells can be updated in one second, which is hundreds of times faster than traditional CPU computation.
引用
收藏
页码:107 / 121
页数:15
相关论文
共 50 条
  • [1] Multi-GPU Accelerated Finite-difference Time-domain Solver in Open Computing Language
    Stefanski, T. P.
    Chavannes, N.
    Kuster, N.
    [J]. PIERS 2011 MARRAKESH: PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM, 2011, : 1850 - 1853
  • [2] The parallel algorithms of the 3D space decomposition in the finite-difference time-domain method
    Walendziuk, Wojciech
    [J]. PROCEEDINGS OF THE XVII INTERNATIONAL CONFERENCE ON ELECTROMAGNETIC DISTURBANCES, EMD 2007, 2007, : 266 - +
  • [3] Parallel Spatially Dispersive Finite-Difference Time-Domain Method Based on GPU
    Song, Wei
    Bao, Meng
    Sheng, Xin-Qing
    [J]. 2012 10TH INTERNATIONAL SYMPOSIUM ON ANTENNAS, PROPAGATION & EM THEORY (ISAPE), 2012, : 1003 - 1007
  • [4] Hardware acceleration of the 3D finite-difference time-domain method
    Durbano, JP
    Humphrey, JR
    Ortiz, FE
    Curt, PF
    Prather, DW
    Mirotznik, MS
    [J]. IEEE ANTENNAS AND PROPAGATION SOCIETY SYMPOSIUM, VOLS 1-4 2004, DIGEST, 2004, : 77 - 80
  • [5] A New Parallel Frequency-Domain Finite-Difference Algorithm Using Multi-GPU
    Wang, Yijing
    He, Xinbo
    Wei, Bin
    [J]. IEEE MICROWAVE AND WIRELESS TECHNOLOGY LETTERS, 2024, 34 (08): : 971 - 974
  • [6] Multi-GPU Implementation of a 3D Finite Difference Time Domain Earthquake Code on Heterogeneous Supercomputers
    Zhou, Jun
    Cui, Yifeng
    Poyraz, Efecan
    Choi, Dong Ju
    Guest, Clark C.
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1255 - 1264
  • [7] 3D marine magnetotelluric modeling and inversion with the finite-difference time-domain method
    de Ryhove, Sebastien de la Kethulle
    Mittet, Rune
    [J]. GEOPHYSICS, 2014, 79 (06) : E269 - E286
  • [8] FPGA-based acceleration of the 3D finite-difference time-domain method
    Durbano, JP
    Ortiz, FE
    Humphrey, JR
    Curt, PF
    Prather, DW
    [J]. 12TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2004, : 156 - 163
  • [9] A Stable Subgridding Finite Difference Time Domain Method on Multi-GPU Cluster
    Ritter, Jan
    Benjes, Magnus
    Murso, Martin
    Wulf, Daniela
    Lange, Sebastian
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC), 2015, : 507 - 511
  • [10] Multi-time-step finite-difference time-domain method
    Zheng, Yang-Ming
    Chu, Qing-Xin
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2004, 32 (09): : 1504 - 1506