Detection and GPU Accelerationof 3D FDTD Algorithms Based on Memory Access Patterns

被引:0
|
作者
Shao, Ran [1 ]
Linton, David [1 ]
Spence, Ivor [1 ]
Milligan, Peter [1 ]
Zheng, Ning [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast, Antrim, North Ireland
关键词
FDTD; Memory access pattern; LLVM; CUDA; 5.0; RADIATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A semi-automatic tool is reported that first analyzes the sequential FDTD program to obtain memory access patterns and related features, and then optimizes the FDTD program with combined use of several types of CUDA memory on both Fermi and Kepler architecture GPUs. The experiments show a 13% and 18% speedup using Fermi and Kepler GPUs respectively compared to the GPU version program without optimization. Up to 142 times speedup is achieved compared to the sequential FDTD C program at a FDTD 3D mesh size of 250*250*250 (15.625 million mesh cells) with 10 layers CPML boundary conditions in 4096 time steps.
引用
收藏
页码:2520 / 2526
页数:7
相关论文
共 50 条
  • [21] An Efficient GPU Cache Architecture for Applications with Irregular Memory Access Patterns
    Li, Bingchao
    Wei, Jizeng
    Sun, Jizhou
    Annavaram, Murali
    Kim, Nam Sung
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (03) : 1 - 24
  • [22] Memory Access Patterns: The Missing Piece of the Multi-GPU Puzzle
    Ben-Nun, Tal
    Levy, Ely
    Barak, Amnon
    Rubin, Eri
    PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
  • [23] Uncertainty Characterization for 3D Object Detection Algorithms
    Ding, Bao Ming
    Huangfu, Yixin
    Habibi, Saeid
    2023 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE & EXPO, ITEC, 2023,
  • [24] Optimizing power efficiency for 3D stacked GPU-in-memory architecture
    Wen, Wen
    Yang, Jun
    Zhang, Youtao
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 49 : 44 - 53
  • [25] Novel 3D GPU based numerical parallel diffusion algorithms in cylindrical coordinates for health care simulation
    Jiang, Beini
    Dai, Weizhong
    Khaliq, Abdul
    Carey, Michelle
    Zhou, Xiaobo
    Zhang, Le
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2015, 109 : 1 - 19
  • [26] A Fast Runtime Visualization of a GPU-Based 3D-FDTD Electromagnetic Simulation
    Aoki, Kota
    Dohi, Keisuke
    Shibata, Yuichiro
    Oguri, Kiyoshi
    Fujimoto, Takafumi
    2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 30 - 37
  • [27] GPU-based 3D wavelet reconstruction with tileboarding
    Garcia, A
    Shen, HW
    VISUAL COMPUTER, 2005, 21 (8-10): : 755 - 763
  • [28] GPU-based 3D wavelet reconstruction with tileboarding
    Antonio Garcia
    Han-Wei Shen
    The Visual Computer, 2005, 21 : 755 - 763
  • [29] 3D solid models rendering based on GPU acceleration
    School of Computer Science and Software, Hangzhou Dianzi University, Hangzhou 310018, China
    Tien Tzu Hsueh Pao, 2008, SUPPL. (144-146):
  • [30] A GPU-based Parallel Slicer for 3D Printing
    Zhang, Xipeng
    Xiong, Gang
    Shen, Zhen
    Zhao, Yiyao
    Guo, Chao
    Dong, Xisong
    2017 13TH IEEE CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2017, : 55 - 60