Particle-resolved thermal lattice Boltzmann simulation using OpenACC on multi-GPUs
被引:6
|
作者:
Xu, Ao
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Inst Extreme Mech, Xian 710072, Peoples R China
China Aerodynam Res & Dev Ctr, Key Lab Icing & Anti Deicing, Mianyang 621000, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Xu, Ao
[1
,2
,3
]
Li, Bo-Tao
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Li, Bo-Tao
[1
]
机构:
[1] Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Inst Extreme Mech, Xian 710072, Peoples R China
[3] China Aerodynam Res & Dev Ctr, Key Lab Icing & Anti Deicing, Mianyang 621000, Peoples R China
We utilize the Open Accelerator (OpenACC) approach for graphics processing unit (GPU) accelerated particle-resolved thermal lattice Boltzmann (LB) simulation. We adopt the momentum-exchange method to calculate fluid-particle interactions to preserve the simplicity of the LB method. To address load imbalance issues, we extend the indirect addressing method to collect fluid-particle link information at each timestep and store indices of fluid-particle link in a fixed index array. We simulate the sedimentation of 4,800 hot particles in cold fluids with a domain size of 40002, and the simulation achieves 1750 million lattice updates per second (MLUPS) on a single GPU. Furthermore, we implement a hybrid OpenACC and message passing interface (MPI) approach for multi-GPU accelerated simulation. This approach incorporates four optimization strategies, including building domain lists, utilizing request-answer communication, overlapping communications with computations, and executing computation tasks concurrently. By reducing data communication between GPUs, hiding communication latency through overlapping computation, and increasing the utilization of GPU resources, we achieve improved performance, reaching 10846 MLUPS using 8 GPUs. Our results demonstrate that the OpenACC-based GPU acceleration is promising for particle-resolved thermal lattice Boltzmann simulation.
机构:
Center for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Wu, Liang
Zhong, Chengwen
论文数: 0引用数: 0
h-index: 0
机构:
Center for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
State Key Laboratory of Science and Technology on Aerodynamics Design and Research, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Zhong, Chengwen
Zheng, Yankui
论文数: 0引用数: 0
h-index: 0
机构:
Center for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Zheng, Yankui
Liu, Sha
论文数: 0引用数: 0
h-index: 0
机构:
State Key Laboratory of Science and Technology on Aerodynamics Design and Research, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Liu, Sha
Zhuo, Congshan
论文数: 0引用数: 0
h-index: 0
机构:
State Key Laboratory of Science and Technology on Aerodynamics Design and Research, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Zhuo, Congshan
Chen, Xiaopeng
论文数: 0引用数: 0
h-index: 0
机构:
School of Mechanics, Civil Engineering and Architecture, Northwestern Polytechnical University, Xi'an 710072, ChinaCenter for High Performance Computing, Northwestern Polytechnical University, Xi'an 710072, China
Chen, Xiaopeng
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics,
2010,
22
(11):
: 1932
-
1939
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Inst Extreme Mech, Xian 710072, Peoples R China
China Aerodynam Res & Dev Ctr, Key Lab Icing & Anti Deicing, Mianyang 621000, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Xu, Ao
Li, Bo -Tao
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Northwestern Polytech Univ, Inst Extreme Mech, Xian 710072, Peoples R China
China Aerodynam Res & Dev Ctr, Key Lab Icing & Anti Deicing, Mianyang 621000, Sichuan, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
Xu, Ao
Li, Bo-Tao
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
机构:
State Key Laboratory of Mechanics and Control of Mechanical Structures,Nanjing University of Aeronautics and Astronautics
Department of Aerodynamics,Nanjing University of Aeronautics and AstronauticsState Key Laboratory of Mechanics and Control of Mechanical Structures,Nanjing University of Aeronautics and Astronautics
Shuhao Guo
Jie Wu
论文数: 0引用数: 0
h-index: 0
机构:
State Key Laboratory of Mechanics and Control of Mechanical Structures,Nanjing University of Aeronautics and Astronautics
Department of Aerodynamics,Nanjing University of Aeronautics and AstronauticsState Key Laboratory of Mechanics and Control of Mechanical Structures,Nanjing University of Aeronautics and Astronautics
机构:
Kyoto Inst Technol, Fac Mech Engn, Sakyo Ku, Kyoto 6068585, JapanKyoto Inst Technol, Grad Sch Sci & Technol, Sakyo Ku, Kyoto 6068585, Japan
Takaki, Tomohiro
Rojas, Roberto
论文数: 0引用数: 0
h-index: 0
机构:
Escuela Politec Nacl, Dept Mech Engn, Ladron de Guevara E11-253, Quito 17012759, EcuadorKyoto Inst Technol, Grad Sch Sci & Technol, Sakyo Ku, Kyoto 6068585, Japan
Rojas, Roberto
论文数: 引用数:
h-index:
机构:
Ohno, Munekazu
Shibuta, Yasushi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Dept Mat Engn, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, JapanKyoto Inst Technol, Grad Sch Sci & Technol, Sakyo Ku, Kyoto 6068585, Japan
Shibuta, Yasushi
Shimokawabe, Takashi
论文数: 0引用数: 0
h-index: 0
机构:
Tokyo Inst Technol, Global Sci Informat & Comp Ctr, Meguro Ku, 2-12-1 i7-3 i7-3 O Okayama, Tokyo 1528550, JapanKyoto Inst Technol, Grad Sch Sci & Technol, Sakyo Ku, Kyoto 6068585, Japan
Shimokawabe, Takashi
Aoki, Takayuki
论文数: 0引用数: 0
h-index: 0
机构:
Tokyo Inst Technol, Global Sci Informat & Comp Ctr, Meguro Ku, 2-12-1 i7-3 i7-3 O Okayama, Tokyo 1528550, JapanKyoto Inst Technol, Grad Sch Sci & Technol, Sakyo Ku, Kyoto 6068585, Japan
机构:
Hong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R ChinaHong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R China
Xu, A.
Shi, L.
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R ChinaHong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R China
Shi, L.
Zhao, T. S.
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R ChinaHong Kong Univ Sci & Technol, Dept Mech & Aerosp Engn, Hong Kong, Hong Kong, Peoples R China