Accelerating Lattice-Boltzmann method with multi-GPUs

被引:0
|
作者
Wu, Liang [1 ]
Zhong, Chengwen [1 ,2 ]
Zheng, Yankui [1 ]
Liu, Sha [2 ]
Zhuo, Congshan [2 ]
Chen, Xiaopeng [3 ]
机构
[1] Center for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
[2] State Key Laboratory of Science and Technology on Aerodynamics Design and Research, Northwestern Polytechnical University, Xi'an 710072, China
[3] School of Mechanics, Civil Engineering and Architecture, Northwestern Polytechnical University, Xi'an 710072, China
关键词
Program processors - Multitasking - Computational fluid dynamics - Iterative methods - Graphics processing unit;
D O I
暂无
中图分类号
学科分类号
摘要
To improve the efficiency and accuracy of simulating complex flow phenomena, an accelerated algorithm of Lattice-Boltzmann method based on CUDA on multi-GPUs desktop platform is proposed. The computational space is evenly divided into cells, which compose multiple sub-domains in accordance with the number of available GPUs. Each GPU stores only the data of its respective sub-domain in global memory and texture memory. To facilitate the particle interaction at the partitioned boundary between adjacent sub-domains, an additional layer of cells is attached to the boundary of each sub-domain to store the data computed by the neighboring GPU hence reducing the communication overhead between GPUs. Then each CPU thread controls an available GPU using multithreading technology. And semaphore is used to synchronize between threads to exchange data in the iteration process for solving the LBM equations. Experimental results show that, the dual-GPU performs 1.77 times faster than that of the single GPU approximately and the resolution of the computational space can be increased to 6144×6144 instead of that of single GPU which is 4160×4160.
引用
收藏
页码:1932 / 1939
相关论文
共 50 条
  • [1] Benchmarking GPUs with a parallel Lattice-Boltzmann code
    Kraus, Jiri
    Pivanti, Marcello
    Schifano, Sebastiano Fabio
    Tripiccione, Raffaele
    Zanella, Marco
    [J]. 2013 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2013, : 160 - 167
  • [2] Particle-resolved thermal lattice Boltzmann simulation using OpenACC on multi-GPUs
    Xu, Ao
    Li, Bo-Tao
    [J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2024, 218
  • [3] Scalable Parallel Algorithm of Multiple-Relaxation-Time Lattice Boltzmann Method with Large Eddy Simulation on Multi-GPUs
    Xu, Lei
    Song, Anping
    Zhang, Wu
    [J]. SCIENTIFIC PROGRAMMING, 2018, 2018
  • [4] Verification of multi-component lattice-Boltzmann method
    Niimura, H
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2003, 17 (1-2): : 157 - 160
  • [5] Accuracy of the lattice-Boltzmann method
    Maier, RS
    Bernard, RS
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 1997, 8 (04): : 747 - 752
  • [6] Comparison of implementations of the lattice-Boltzmann method
    Mattila, Keijo
    Hyvaeluoma, Jari
    Timonen, Jussi
    Rossi, Tuomo
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (07) : 1514 - 1524
  • [7] Lattice-Boltzmann Method for Complex Flows
    Aidun, Cyrus K.
    Clausen, Jonathan R.
    [J]. ANNUAL REVIEW OF FLUID MECHANICS, 2010, 42 : 439 - 472
  • [8] Accelerating the Lattice Boltzmann Method
    Altoyan, Wesson
    Alonso, Juan J.
    [J]. 2023 IEEE AEROSPACE CONFERENCE, 2023,
  • [9] On development of lattice-Boltzmann method based multi phase flow solver
    Raval, Tejas
    Patel, Dhruv
    Lakdawala, Absar
    [J]. NUMERICAL HEAT TRANSFER PART B-FUNDAMENTALS, 2024,
  • [10] Multi-GPUs parallel computation of dendrite growth in forced convection using the phase-field-lattice Boltzmann model
    Sakane, Shinji
    Takaki, Tomohiro
    Rojas, Roberto
    Ohno, Munekazu
    Shibuta, Yasushi
    Shimokawabe, Takashi
    Aoki, Takayuki
    [J]. JOURNAL OF CRYSTAL GROWTH, 2017, 474 : 154 - 159