A Multi-GPU Parallel Algorithm in Hypersonic Flow Computations

被引：9

作者：

Lai, Jianqi ^{[1
]}

Li, Hua ^{[1
]}

Tian, Zhengyu ^{[1
]}

Zhang, Ye ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Hunan, Peoples R China

来源：

MATHEMATICAL PROBLEMS IN ENGINEERING | 2019年 / 2019卷

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1155/2019/2053156

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Computational fluid dynamics (CFD) plays an important role in the optimal design of aircraft and the analysis of complex flow mechanisms in the aerospace domain. The graphics processing unit (GPU) has a strong floating-point operation capability and a high memory bandwidth in data parallelism, which brings great opportunities for CFD. A cell-centred finite volume method is applied to solve three-dimensional compressible Navier-Stokes equations on structured meshes with an upwind AUSM+UP numerical scheme for space discretization, and four-stage Runge-Kutta method is used for time discretization. Compute unified device architecture (CUDA) is used as a parallel computing platform and programming model for GPUs, which reduces the complexity of programming. The main purpose of this paper is to design an extremely efficient multi-GPU parallel algorithm based on MPI+CUDA to study the hypersonic flow characteristics. Solutions of hypersonic flow over an aerospace plane model are provided at different Mach numbers. The agreement between numerical computations and experimental measurements is favourable. Acceleration performance of the parallel platform is studied with single GPU, two GPUs, and four GPUs. For single GPU implementation, the speedup reaches 63 for the coarser mesh and 78 for the finest mesh. GPUs are better suited for compute-intensive tasks than traditional CPUs. For multi-GPU parallelization, the speedup of four GPUs reaches 77 for the coarser mesh and 147 for the finest mesh; this is far greater than the acceleration achieved by single GPU and two GPUs. It is prospective to apply the multi-GPU parallel algorithm to hypersonic flow computations.

引用

页数：15

共 50 条

[1] An efficient parallel collaborative filtering algorithm on multi-GPU platform
Wang, Zhongya
Liu, Ying
Chiu, Steve
[J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (06): : 2080 - 2094
[2] An efficient parallel collaborative filtering algorithm on multi-GPU platform
Zhongya Wang
Ying Liu
Steve Chiu
[J]. The Journal of Supercomputing, 2016, 72 : 2080 - 2094
[3] Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement
Xiao, Han
Song, Yu-Pu
Zhou, Qing-Lei
[J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (02): : 99 - 114
[4] Multi-GPU Parallel Memetic Algorithm for Capacitated Vehicle Routing Problem
Wodecki, Mieczyslaw
Bozejko, Wojciech
Karpinski, Michaffl
Pacut, Maciej
[J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 207 - 214
[5] A multi-GPU parallel optimization model for the preconditioned conjugate gradient algorithm
Gao, Jiaquan
Zhou, Yuanshen
He, Guixia
Xia, Yifei
[J]. PARALLEL COMPUTING, 2017, 63 : 1 - 16
[6] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
Boratto, Murilo
Alonso, Pedro
Ramiro, Carla
Barreto, Marcos
Coelho, Leandro
[J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
[7] Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems
Ernsting, Steffen
Kuchen, Herbert
[J]. APPLICATIONS, TOOLS AND TECHNIQUES ON THE ROAD TO EXASCALE COMPUTING, 2012, 22 : 509 - 518
[8] Efficient parallel A* search on multi-GPU system
He, Xin
Yao, Yapeng
Chen, Zhiwen
Sun, Jianhua
Chen, Hao
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 35 - 47
[9] A Parallel Multi-GPU Clonal Selection Algorithm for Optimization Using OpenCL and OpenMP
Russo, Igor L. S.
Bernardino, Heder S.
Barbosa, Helio J. C.
[J]. 2016 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2016,
[10] Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations
Ben-Nun, Tal
Sutton, Michael
Pai, Sreepathi
Pingali, Keshav
[J]. ACM SIGPLAN NOTICES, 2017, 52 (08) : 235 - 248

← 1 2 3 4 5 →