GPU-accelerated MART and concurrent cross-correlation for tomographic PIV

被引:18
|
作者
Zeng, Xin [1 ,2 ]
He, Chuangxin [1 ,2 ]
Liu, Yingzheng [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Key Lab Educ, Minist Power Machinery & Engn, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Gas Turbine Res Inst, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
GRAPH ALGORITHMS; IMAGE; IMPLEMENTATION; LAYER;
D O I
10.1007/s00348-022-03444-3
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This paper presents a novel Graphics Processing Unit (GPU)-accelerated method for large-scale data processing of tomographic particle image velocimetry. The multiplicative algebraic reconstruction technique (MART) is utilized to reconstruct three-dimensional (3D) particle fields, and cross-correlation with fast Fourier transform is used to generate the displacement vectors. The Compute Unified Device Architecture (CUDA) C programming model is used to port the velocity field reconstruction from CPU code to GPU code to improve efficiency. For similar reconstruction tasks, a particular thread grid hierarchy is designed to construct the corresponding computational kernel functions, and each task is launched in a single thread. A modified strategy of pixel batch processing is then used to manage the GPU memory access. Subsequently, the asynchronous stream concurrency is used to generate the velocity field with the GPU cuFFT library. A synthetic 3D experiment with a ring vortex is carried out to verify the accuracy and efficiency of the developed method. The parallel results agree well with the generated data and other research conclusions reported in the literature. The speed-up ratio by multi-core CPU (Intel (R) Xeon (R) Platinum 8168) parallel implementation with OpenMP converges to 2.5 x in MFG-MART and 3.0 x in cross-correlation. In contrast to a 24-core CPU implementation, a GPU (NVIDIA Tesla V100S, 32 GB) under maximum memory usage achieves an impressive speed-up ratio of over 20 x in parallel MFG-MART and 4 x in concurrent cross-correlation. The measurement of turbulent flow in a circular jet flow at Reynolds 3,000 is used to examine the efficiency promotion of the parallelized framework in real experimental settings. For the synthetic volume reconstruction of 700 x 700 x 140 voxels and cross-correlation with 41(3) voxels window in a 75% overlap, and the experimental volume reconstruction of 550 x 1100 x 550 voxels and cross-correlation with 32 3 voxels window in a 50% overlap, a frame of velocity field can be completed within 2 min in each domain. [GRAPHICS] .
引用
收藏
页数:18
相关论文
共 50 条
  • [1] GPU-accelerated MART and concurrent cross-correlation for tomographic PIV
    Xin Zeng
    Chuangxin He
    Yingzheng Liu
    Experiments in Fluids, 2022, 63
  • [2] Efficient GPU-accelerated parallel cross-correlation
    Madera, Karel
    Smelko, Adam
    Krulis, Martin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2025, 199
  • [3] THEORY OF CROSS-CORRELATION ANALYSIS OF PIV IMAGES
    KEANE, RD
    ADRIAN, RJ
    APPLIED SCIENTIFIC RESEARCH, 1992, 49 (03): : 191 - 215
  • [4] Microsecond-scale Preemption for Concurrent GPU-accelerated DNN Inferences
    Han, Mingcong
    Zhang, Hanze
    Chen, Rong
    Chen, Haibo
    PROCEEDINGS OF THE 16TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2022, 2022, : 539 - 558
  • [5] High level GPU-accelerated 2D PIV framework in Python']Python
    Nazarov, N. A.
    Terekhov, V. V.
    COMPUTER PHYSICS COMMUNICATIONS, 2024, 295
  • [6] High resolution Cross-Correlation PIV on photographic film
    Cater, J
    Kostas, J
    Fouras, A
    Soria, J
    INTERNATIONAL CONFERENCE ON APPLIED OPTICAL METROLOGY, 1998, 3407 : 539 - 543
  • [7] GPU-Accelerated Incremental Correlation Clustering of Large Data with Visual Feedback
    Papenhausen, Eric
    Wang, Bing
    Ha, Sungsoo
    Zelenyuk, Alla
    Imre, Dan
    Mueller, Klaus
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [8] Cross-correlation and cross-tracking of compressed digital PIV images
    Gray, C
    OPTICAL TECHNOLOGY IN FLUID, THERMAL, AND COMBUSTION FLOW III, 1997, 3172 : 153 - 164
  • [9] Calculation of Cross-correlation Function Accelerated by Tensor Cores with TensorFloat-32 Precision on Ampere GPU
    Kikuchi, Yuma
    Fujita, Kohei
    Ichimura, Tsuyoshi
    Hori, Muneo
    Maddegedara, Lalith
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 277 - 290
  • [10] ACCELERATION OF RECURSIVE CROSS-CORRELATION PIV USING MULTIPLE GPUS
    Tarashima, Shuhei
    Someya, Satoshi
    Okamoto, Koji
    PROCEEDINGS OF THE ASME/JSME 8TH THERMAL ENGINEERING JOINT CONFERENCE 2011, VOL 1 PTS A AND B, 2011, : 1221 - 1227