Simulation and reconstruction for 3D elastic wave using multi-GPU and CUDA-aware MPI

被引:0
|
作者
Cai, Wei [1 ]
Zhu, Peimin [1 ]
Li, Ziang [1 ]
机构
[1] China Univ Geosci, Sch Geophys & Geomat, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
3D elastic FDTD; Wavefield reconstruction; High performance computing; Domain decomposition technique; Multi-GPU parallel computing; CUDA-aware MPI; REVERSE TIME MIGRATION; PROPAGATION; ACCELERATION; COMPUTATION; INVERSION; FIELD;
D O I
10.1016/j.cageo.2024.105616
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
3D finite -difference time -domain numerical simulation and reconstruction based on the domain decomposition technique are essential parts of high-performance computation for reverse -time migration and full -waveform inversion. However, the low GPU utilization in computing for small -sized models and the tremendous memory consumption for large -sized models may result in low computational efficiency and high memory costs. This paper proposes a contiguous memory management (CMM) method and a variable -order wavefield reconstruction (VWR) method. The CMM allocates the memory of many small -sized arrays used for MPI communications on a larger -sized contiguous memory block, which aims to reduce the number of MPI communications between subdomains and improve the communication bandwidth, thus reducing the MPI time overhead and improving the GPU utilization. Meanwhile, the VWR can flexibly set the number of layers of boundary wavefield used for source wavefield reconstruction according to the host memory capacity and accuracy requirements. Since one layer of boundary wavefield could be stored using the VWR, the memory consumption of host memory can be significantly alleviated. Numerical experiments show that GPU utilization in computing for the model with a size of 121 3 can be improved from 25% to 90% using the CMM method, and the VWR method can reduce memory consumption by about 86% while maintaining good accuracy in wavefield reconstruction. In addition, the issue of how to obtain a domain decomposition scheme with optimal performance is discussed in this paper.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Adaptive multi-GPU Exchange Monte Carlo for the 3D Random Field Ising Model
    Navarro, Cristobal A.
    Huang, Wei
    Deng, Youjin
    COMPUTER PHYSICS COMMUNICATIONS, 2016, 205 : 48 - 60
  • [32] GPU-ACCELERATED INTERACTIVE VISUALIZATION OF 3D VOLUMETRIC DATA USING CUDA
    Kumar, Piyush
    Agrawal, Anupam
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2013, 13 (02)
  • [33] Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA
    Dariusz Mrozek
    Miłosz Brożek
    Bożena Małysiak-Mrozek
    Journal of Molecular Modeling, 2014, 20
  • [34] PARALLEL STRATEGY OF FMBEM FOR 3D ELASTOSTATICS AND ITS GPU IMPLEMENTATION USING CUDA
    Xia, Zhaohui
    Wang, Qifu
    Huang, Yunbao
    Wei Yixiong
    Wang Yingjun
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2014, VOL 1A, 2014,
  • [35] Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA
    Mrozek, Dariusz
    Brozek, Milosz
    Malysiak-Mrozek, Bozena
    JOURNAL OF MOLECULAR MODELING, 2014, 20 (02)
  • [36] Efficient Multi-GPU Calculation of Local Radiomic Features From 2D and 3D Images
    Neph, R.
    Sheng, K.
    MEDICAL PHYSICS, 2018, 45 (06) : E233 - E233
  • [37] Solving 3D anisotropic elastic wave equations on parallel GPU devices
    Weiss, Robin M.
    Shragge, Jeffrey
    GEOPHYSICS, 2013, 78 (02) : F7 - F15
  • [38] Accelerating 3-D Acoustic Full Waveform Inversion Using a Multi-GPU Cluster
    Chen, Yanling
    Zhu, Pei-Min
    Wen, Wudi
    Jiang, Jinpeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [39] Real-Time Visualize the 3D Reconstruction Procedure Using CUDA
    Bi, Wenyuan
    Chen, Zhiqiang
    Zhang, Li
    Xing, Yuxiang
    Wang, Yajie
    2009 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2009, : 883 - +
  • [40] PARALLEL 3D FINITE-DIFFERENCE TIME-DOMAIN METHOD ON MULTI-GPU SYSTEMS
    Du, Liu-Ge
    Li, Kang
    Kong, Fan-Min
    Hu, Yuan
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2011, 22 (02): : 107 - 121