GPU-accelerated MART and concurrent cross-correlation for tomographic PIV

被引:18
|
作者
Zeng, Xin [1 ,2 ]
He, Chuangxin [1 ,2 ]
Liu, Yingzheng [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Key Lab Educ, Minist Power Machinery & Engn, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Gas Turbine Res Inst, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
GRAPH ALGORITHMS; IMAGE; IMPLEMENTATION; LAYER;
D O I
10.1007/s00348-022-03444-3
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This paper presents a novel Graphics Processing Unit (GPU)-accelerated method for large-scale data processing of tomographic particle image velocimetry. The multiplicative algebraic reconstruction technique (MART) is utilized to reconstruct three-dimensional (3D) particle fields, and cross-correlation with fast Fourier transform is used to generate the displacement vectors. The Compute Unified Device Architecture (CUDA) C programming model is used to port the velocity field reconstruction from CPU code to GPU code to improve efficiency. For similar reconstruction tasks, a particular thread grid hierarchy is designed to construct the corresponding computational kernel functions, and each task is launched in a single thread. A modified strategy of pixel batch processing is then used to manage the GPU memory access. Subsequently, the asynchronous stream concurrency is used to generate the velocity field with the GPU cuFFT library. A synthetic 3D experiment with a ring vortex is carried out to verify the accuracy and efficiency of the developed method. The parallel results agree well with the generated data and other research conclusions reported in the literature. The speed-up ratio by multi-core CPU (Intel (R) Xeon (R) Platinum 8168) parallel implementation with OpenMP converges to 2.5 x in MFG-MART and 3.0 x in cross-correlation. In contrast to a 24-core CPU implementation, a GPU (NVIDIA Tesla V100S, 32 GB) under maximum memory usage achieves an impressive speed-up ratio of over 20 x in parallel MFG-MART and 4 x in concurrent cross-correlation. The measurement of turbulent flow in a circular jet flow at Reynolds 3,000 is used to examine the efficiency promotion of the parallelized framework in real experimental settings. For the synthetic volume reconstruction of 700 x 700 x 140 voxels and cross-correlation with 41(3) voxels window in a 75% overlap, and the experimental volume reconstruction of 550 x 1100 x 550 voxels and cross-correlation with 32 3 voxels window in a 50% overlap, a frame of velocity field can be completed within 2 min in each domain. [GRAPHICS] .
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station
    Peter J. Lu
    Hidekazu Oki
    Catherine A. Frey
    Gregory E. Chamitoff
    Leroy Chiao
    Edward M. Fincke
    C. Michael Foale
    Sandra H. Magnus
    William S. McArthur
    Daniel M. Tani
    Peggy A. Whitson
    Jeffrey N. Williams
    William V. Meyer
    Ronald J. Sicker
    Brion J. Au
    Mark Christiansen
    Andrew B. Schofield
    David A. Weitz
    Journal of Real-Time Image Processing, 2010, 5 : 179 - 193
  • [32] Evaluation of FFT-based cross-correlation algorithms for PIV in a periodic grooved channel
    R. Gilbert
    D. A. Johnson
    Experiments in Fluids, 2003, 34 (4) : 473 - 483
  • [33] Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station
    Lu, Peter J.
    Oki, Hidekazu
    Frey, Catherine A.
    Chamitoff, Gregory E.
    Chiao, Leroy
    Fincke, Edward M.
    Foale, C. Michael
    Magnus, Sandra H.
    McArthur, William S., Jr.
    Tani, Daniel M.
    Whitson, Peggy A.
    Williams, Jeffrey N.
    Meyer, William V.
    Sicker, Ronald J.
    Au, Brion J.
    Christiansen, Mark
    Schofield, Andrew B.
    Weitz, David A.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2010, 5 (03) : 179 - 193
  • [34] CCD RECORDING METHOD FOR CROSS-CORRELATION PIV DEVELOPMENT IN UNSTATIONARY HIGH-SPEED FLOW
    LECORDIER, B
    MOUQALLID, M
    VOTTIER, S
    ROULAND, E
    ALLANO, D
    TRINITE, M
    EXPERIMENTS IN FLUIDS, 1994, 17 (03) : 205 - 208
  • [35] Comparison between object and image plane cross-correlation for stereoscopic PIV in the presence of pixel locking
    Jankee, Girish K.
    Ganapathisubramani, Bharathram
    EXPERIMENTS IN FLUIDS, 2020, 61 (03)
  • [36] A general approach to evaluate the ensemble cross-correlation response for PIV using Kernel density estimation
    Theunissen, Raf
    Edwards, Matthew
    EXPERIMENTS IN FLUIDS, 2018, 59 (11)
  • [37] Light-in-flight holography with switched reference beams for cross-correlation in deep volume PIV
    Herrmann, SF
    Geiger, M
    Hinsch, KD
    Peinke, J
    LASER TECHNIQUES FOR FLUID MECHANICS, 2002, : 3 - 23
  • [38] Detectability of the τes-21cm cross-correlation: a tomographic probe of patchy reionization
    Roy, Anirban
    Lapi, Andrea
    Spergel, David
    Basak, Soumen
    Baccigalupi, Carlo
    JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2020, (03):
  • [39] A general approach to evaluate the ensemble cross-correlation response for PIV using Kernel density estimation
    Raf Theunissen
    Matthew Edwards
    Experiments in Fluids, 2018, 59
  • [40] Comparison between object and image plane cross-correlation for stereoscopic PIV in the presence of pixel locking
    Girish K. Jankee
    Bharathram Ganapathisubramani
    Experiments in Fluids, 2020, 61