Optimization and acceleration of flow simulations for CFD on CPU/GPU architecture

被引:11
|
作者
Lei, Jiang [1 ]
Li, Da-li [1 ]
Zhou, Yun-long [1 ]
Liu, Wei [1 ]
机构
[1] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China
关键词
Euler equation; GPU; CUDA; CFD; DIRECT NUMERICAL-SIMULATION; INCOMPRESSIBLE FLOWS; GPU; SOLVER;
D O I
10.1007/s40430-019-1793-9
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
With the increasing requirement of high computational power in computational fluid dynamics (CFD) field, the graphic processing units (GPUs) with great floating-point computing capability play more important roles. This work explores the porting of an Euler solver from central processing units (CPUs) to three different CPU/GPU heterogeneous hardware platforms using MUSCL and NND schemes, and then the computational acceleration of one-dimensional (1D) Riemann problem and two-dimensional (2D) flow past a forward-facing step is investigated. Based on hardware structures, memory models and programming methods, the working manner of heterogeneous systems was firstly introduced in this paper. Subsequently, three different heterogeneous methods employed in the current study were presented in detail, while porting all parts of the solver loop to GPU possessed the best performance among them. Several optimization strategies suitable for the solver were adopted to achieve substantial execution speedups, while using shared memory on GPU was relatively rarely reported in CFD literature. Finally, the simulation of 1D Riemann verified the reliability of the modified codes on GPU, demonstrating strong ability in capturing discontinuities of both schemes. The two cases with their 1D computational domains discretized into 10,000 cells both realized a speedup exceeding 25, compared to that executed on a single-core CPU. In simulation of the 2D step flow, we came to the highest speedups of 260 for MUSCL scheme with 800x400 mesh size and 144 for NND scheme with 400x200 computational domain, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] GPU acceleration of Eulerian-Lagrangian particle-laden turbulent flow simulations
    Sweet, James
    Richter, David H.
    Thain, Douglas
    INTERNATIONAL JOURNAL OF MULTIPHASE FLOW, 2018, 99 : 437 - 445
  • [42] Jaya optimization algorithm with GPU acceleration
    A. Jimeno-Morenilla
    J. L. Sánchez-Romero
    H. Migallón
    H. Mora-Mora
    The Journal of Supercomputing, 2019, 75 : 1094 - 1106
  • [43] Jaya optimization algorithm with GPU acceleration
    Jimeno-Morenilla, A.
    Sanchez-Romero, J. L.
    Migallon, H.
    Mora-Mora, H.
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (03): : 1094 - 1106
  • [44] Wind Turbine Simulations Using CPU/GPU Heterogeneous Computing
    Yong Su Jung
    James Baeder
    International Journal of Aeronautical and Space Sciences, 2024, 25 : 331 - 344
  • [45] CPU and GPU Performance of Large Scale Numerical Simulations in Geophysics
    Dorostkar, Ali
    Lukarski, Dimitar
    Lund, Bjorn
    Neytcheva, Maya
    Notay, Yvan
    Schmidt, Peter
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT I, 2014, 8805 : 12 - 23
  • [46] Acceleration of Stereo-Matching on Multi-core CPU and GPU
    Xu, Tian
    Cockshott, Paul
    Oehler, Susanne
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 108 - 115
  • [47] Acceleration of Differential Power Analysis through the Parallel Use of GPU and CPU
    Lee, Sung Jae
    Seo, Seog Chung
    Han, Dong-Guk
    Hong, Seokhie
    Lee, Sangjin
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2010, E93A (09) : 1688 - 1692
  • [48] CPU-GPU hybrid parallel strategy for cosmological simulations
    Wang, Yueqing
    Dou, Yong
    Guo, Song
    Lei, Yuanwu
    Zou, Dan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03): : 748 - 765
  • [49] Wind Turbine Simulations Using CPU/GPU Heterogeneous Computing
    Jung, Yong Su
    Baeder, James
    INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024, 25 (02) : 331 - 344
  • [50] Optimization of the HEFT algorithm for a CPU-GPU environment
    Shetti, Karan R.
    Fahmy, Suhaib A.
    Bretschneider, Timo
    2013 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2013, : 212 - 218