GPU-accelerated MART and concurrent cross-correlation for tomographic PIV

被引:18
|
作者
Zeng, Xin [1 ,2 ]
He, Chuangxin [1 ,2 ]
Liu, Yingzheng [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Key Lab Educ, Minist Power Machinery & Engn, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Gas Turbine Res Inst, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
GRAPH ALGORITHMS; IMAGE; IMPLEMENTATION; LAYER;
D O I
10.1007/s00348-022-03444-3
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This paper presents a novel Graphics Processing Unit (GPU)-accelerated method for large-scale data processing of tomographic particle image velocimetry. The multiplicative algebraic reconstruction technique (MART) is utilized to reconstruct three-dimensional (3D) particle fields, and cross-correlation with fast Fourier transform is used to generate the displacement vectors. The Compute Unified Device Architecture (CUDA) C programming model is used to port the velocity field reconstruction from CPU code to GPU code to improve efficiency. For similar reconstruction tasks, a particular thread grid hierarchy is designed to construct the corresponding computational kernel functions, and each task is launched in a single thread. A modified strategy of pixel batch processing is then used to manage the GPU memory access. Subsequently, the asynchronous stream concurrency is used to generate the velocity field with the GPU cuFFT library. A synthetic 3D experiment with a ring vortex is carried out to verify the accuracy and efficiency of the developed method. The parallel results agree well with the generated data and other research conclusions reported in the literature. The speed-up ratio by multi-core CPU (Intel (R) Xeon (R) Platinum 8168) parallel implementation with OpenMP converges to 2.5 x in MFG-MART and 3.0 x in cross-correlation. In contrast to a 24-core CPU implementation, a GPU (NVIDIA Tesla V100S, 32 GB) under maximum memory usage achieves an impressive speed-up ratio of over 20 x in parallel MFG-MART and 4 x in concurrent cross-correlation. The measurement of turbulent flow in a circular jet flow at Reynolds 3,000 is used to examine the efficiency promotion of the parallelized framework in real experimental settings. For the synthetic volume reconstruction of 700 x 700 x 140 voxels and cross-correlation with 41(3) voxels window in a 75% overlap, and the experimental volume reconstruction of 550 x 1100 x 550 voxels and cross-correlation with 32 3 voxels window in a 50% overlap, a frame of velocity field can be completed within 2 min in each domain. [GRAPHICS] .
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Conditional averaging of PIV plane wake data using a cross-correlation approach
    E. Konstantinidis
    S. Balabani
    M. Yianneskis
    Experiments in Fluids, 2005, 39 : 38 - 47
  • [22] Conditional averaging of PIV plane wake data using a cross-correlation approach
    Konstantinidis, E
    Balabani, S
    Yianneskis, M
    EXPERIMENTS IN FLUIDS, 2005, 39 (01) : 38 - 47
  • [23] Panda: A Compiler Framework for Concurrent CPUGPU Execution of 3D Stencil Computations on GPU-accelerated Supercomputers
    Sourouri, Mohammed
    Baden, Scott B.
    Cai, Xing
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (03) : 711 - 729
  • [24] Accelerating a cross-correlation score function to search modifications using a single GPU
    Kim, Hyunwoo
    Han, Sunggeun
    Um, Jung-Ho
    Park, Kyongseok
    BMC BIOINFORMATICS, 2018, 19
  • [25] Accelerating a cross-correlation score function to search modifications using a single GPU
    Hyunwoo Kim
    Sunggeun Han
    Jung-Ho Um
    Kyongseok Park
    BMC Bioinformatics, 19
  • [26] The effect of a discrete window offset on the accuracy of cross-correlation analysis of digital PIV recordings
    Westerweel, J
    Dabiri, D
    Gharib, M
    EXPERIMENTS IN FLUIDS, 1997, 23 (01) : 20 - 28
  • [27] Evaluation of FFT-based cross-correlation algorithms for PIV in a periodic grooved channel
    Gilbert, R
    Johnson, DA
    EXPERIMENTS IN FLUIDS, 2003, 34 (04) : 473 - 483
  • [28] The effect of a discrete window offset on the accuracy of cross-correlation analysis of digital PIV recordings
    J. Westerweel
    D. Dabiri
    M. Gharib
    Experiments in Fluids, 1997, 23 : 20 - 28
  • [29] F2DPR: a fast and robust cross-correlation technique for volumetric PIV
    Earl, Thomas
    Jeon, Young Jin
    Lecordier, Bertrand
    David, Laurent
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2016, 27 (08)
  • [30] KiDS-450: tomographic cross-correlation of galaxy shear with Planck lensing
    Harnois-Deraps, Joachim
    Troster, Tilman
    Chisari, Nora Elisa
    Heymans, Catherine
    van Waerbeke, Ludovic
    Asgari, Marika
    Bilicki, Maciej
    Choi, Ami
    Erben, Thomas
    Hildebrandt, Hendrik
    Hoekstra, Henk
    Joudaki, Shahab
    Kuijken, Konrad
    Merten, Julian
    Miller, Lance
    Robertson, Naomi
    Schneider, Peter
    Viola, Massimo
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2017, 471 (02) : 1619 - 1633