Two-level parallel load balancing strategy for accelerating DSMC simulations in near-continuum gases

被引:0
|
作者
Xiao, Chenxiang [1 ]
Zhang, Chenchen [2 ]
Zhang, Bin [1 ,3 ]
Xu, Hui [1 ]
Liu, Hong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Aeronaut & Astronaut, 800 Dong Chuan Rd, Shanghai 200240, Peoples R China
[2] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[3] Shanghai Jiao Tong Univ, Sichuan Res Inst, Chengdu 610213, Peoples R China
来源
关键词
DSMC; MPI/OpenMP; load balance; nonblock communication; MONTE-CARLO METHOD; HYPERSONIC FLOW; NUMERICAL-SIMULATION; CIRCULAR-CYLINDER; IMPLEMENTATION;
D O I
10.1142/S0129183124501985
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Direct Simulation Monte Carlo (DSMC) algorithm is widely employed for simulating rarefied gas flows and is increasingly applied in near-continuum regimes for research and engineering purposes. However, its computational demands, notably load imbalance and extended simulation time, hinder widespread adoption. Addressing these challenges, this paper introduces the Two-Level parallel load balancing strategy. This novel approach combines thread-level and multi-process parallelism to enhance load balancing and reduce simulation time. Key features include a thread-level load-decoupling strategy implemented via OpenMP and a multi-process load balancing mechanism employing distributed memory via MPI. Building upon our previous PartPlusColl [L. Li, W. Ren and B. Zhang, J. Aeronaut. Astronaut. Aviat. Ser. A 46, 88 (2014)] approach, the load balancing mechanism utilizes Stop At Risk (SAR) criteria for repartitioning with METIS. Additionally, a specialized data transmission mechanism utilizing MPI nonblocking communication minimizes global communication between processes. Validation and evaluation are performed using four hypersonic flow cases around a cylinder and sphere, demonstrating significant improvements. Notably, the proposed strategy achieves 30% enhancement over the PartPlusColl strategy under 512 CPU cores compared to 16 CPU cores, and reduces between-process communication time with 33.57%. These advancements contribute to enhancing the effectiveness of the DSMC algorithm in near-continuum aerodynamic simulations.
引用
收藏
页数:17
相关论文
共 34 条
  • [31] Fault-tolerant control strategy for open-circuit fault of two-parallel-connected three-phase AC–DC two-level PWM converter
    Hyun-Keun Ku
    Jun-Hyung Jung
    Jin-Woo Park
    Jang-Mok Kim
    Yung-Dueg Son
    Journal of Power Electronics, 2020, 20 : 731 - 742
  • [32] Fault-tolerant control strategy for open-circuit fault of two-parallel-connected three-phase AC-DC two-level PWM converter
    Ku, Hyun-Keun
    Jung, Jun-Hyung
    Park, Jin-Woo
    Kim, Jang-Mok
    Son, Yung-Dueg
    JOURNAL OF POWER ELECTRONICS, 2020, 20 (03) : 731 - 742
  • [33] A Coordinated Strategy Providing Zero-Sequence Circulating Current Suppression and Neutral-Point Potential Balancing in Two Parallel Three-Level Converters
    Liang, Zhigang
    Lin, Xinchun
    Qiao, Xuesong
    Kang, Yong
    Gao, Bofeng
    IEEE JOURNAL OF EMERGING AND SELECTED TOPICS IN POWER ELECTRONICS, 2018, 6 (01) : 363 - 376
  • [34] Two-Level Optimal Bidding Strategy for Load Aggregator Based on a Data-Driven Approach Combined With LSTM-Based Forecasting and Agent-Based Models
    Ryu, Han Seok
    Kim, Hyung Joon
    Kim, Mun Kyeom
    IEEE ACCESS, 2023, 11 : 89249 - 89264