A Multi-GPU Parallel Algorithm in Hypersonic Flow Computations

被引:9
|
作者
Lai, Jianqi [1 ]
Li, Hua [1 ]
Tian, Zhengyu [1 ]
Zhang, Ye [1 ]
机构
[1] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2019/2053156
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Computational fluid dynamics (CFD) plays an important role in the optimal design of aircraft and the analysis of complex flow mechanisms in the aerospace domain. The graphics processing unit (GPU) has a strong floating-point operation capability and a high memory bandwidth in data parallelism, which brings great opportunities for CFD. A cell-centred finite volume method is applied to solve three-dimensional compressible Navier-Stokes equations on structured meshes with an upwind AUSM+UP numerical scheme for space discretization, and four-stage Runge-Kutta method is used for time discretization. Compute unified device architecture (CUDA) is used as a parallel computing platform and programming model for GPUs, which reduces the complexity of programming. The main purpose of this paper is to design an extremely efficient multi-GPU parallel algorithm based on MPI+CUDA to study the hypersonic flow characteristics. Solutions of hypersonic flow over an aerospace plane model are provided at different Mach numbers. The agreement between numerical computations and experimental measurements is favourable. Acceleration performance of the parallel platform is studied with single GPU, two GPUs, and four GPUs. For single GPU implementation, the speedup reaches 63 for the coarser mesh and 78 for the finest mesh. GPUs are better suited for compute-intensive tasks than traditional CPUs. For multi-GPU parallelization, the speedup of four GPUs reaches 77 for the coarser mesh and 147 for the finest mesh; this is far greater than the acceleration achieved by single GPU and two GPUs. It is prospective to apply the multi-GPU parallel algorithm to hypersonic flow computations.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An efficient parallel collaborative filtering algorithm on multi-GPU platform
    Wang, Zhongya
    Liu, Ying
    Chiu, Steve
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (06): : 2080 - 2094
  • [2] An efficient parallel collaborative filtering algorithm on multi-GPU platform
    Zhongya Wang
    Ying Liu
    Steve Chiu
    [J]. The Journal of Supercomputing, 2016, 72 : 2080 - 2094
  • [3] Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement
    Xiao, Han
    Song, Yu-Pu
    Zhou, Qing-Lei
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (02): : 99 - 114
  • [4] Multi-GPU Parallel Memetic Algorithm for Capacitated Vehicle Routing Problem
    Wodecki, Mieczyslaw
    Bozejko, Wojciech
    Karpinski, Michaffl
    Pacut, Maciej
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 207 - 214
  • [5] A multi-GPU parallel optimization model for the preconditioned conjugate gradient algorithm
    Gao, Jiaquan
    Zhou, Yuanshen
    He, Guixia
    Xia, Yifei
    [J]. PARALLEL COMPUTING, 2017, 63 : 1 - 16
  • [6] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    Coelho, Leandro
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
  • [7] Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems
    Ernsting, Steffen
    Kuchen, Herbert
    [J]. APPLICATIONS, TOOLS AND TECHNIQUES ON THE ROAD TO EXASCALE COMPUTING, 2012, 22 : 509 - 518
  • [8] Efficient parallel A* search on multi-GPU system
    He, Xin
    Yao, Yapeng
    Chen, Zhiwen
    Sun, Jianhua
    Chen, Hao
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 35 - 47
  • [9] A Parallel Multi-GPU Clonal Selection Algorithm for Optimization Using OpenCL and OpenMP
    Russo, Igor L. S.
    Bernardino, Heder S.
    Barbosa, Helio J. C.
    [J]. 2016 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2016,
  • [10] Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations
    Ben-Nun, Tal
    Sutton, Michael
    Pai, Sreepathi
    Pingali, Keshav
    [J]. ACM SIGPLAN NOTICES, 2017, 52 (08) : 235 - 248