A GPU-Accelerated automated multilevel substructuring method for modal analysis of structures

被引:0
|
作者
Wang, Guidong [1 ]
Wang, Yujie [1 ]
Chen, Zeyu [1 ]
Wang, Feiqi [1 ]
Li, She [1 ]
Cui, Xiangyang [1 ,2 ]
机构
[1] Hunan Univ, State Key Lab Adv Design & Mfg Technol Vehicle, Changsha 410082, Peoples R China
[2] Hunan Maixi Software Co Ltd, Changsha 410082, Peoples R China
基金
美国国家科学基金会;
关键词
AMLS method; Heterogeneous parallel computing; Finite element analysis; Eigenvalue problem; ALGORITHM; SYSTEMS;
D O I
10.1016/j.compstruc.2024.107516
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, a novel GPU-accelerated heterogeneous method for the automated multilevel substructuring method(HAMLS) is presented for dealing large finite element models in structural dynamics. Different parallel modes based on node, subtree, and eigenpair have been developed in the solution steps of AMLS to achieve a heterogeneous strategy. First, a new data management method is designed during the model transformation phase to eliminate the determinacy race in the parallel strategy of the separator tree. Considering the distribution characteristics of the nodes in the separator tree and the dependence of node tasks, a load balancing heterogeneous parallel strategy is designed to take full advantage of hosts and devices. By developing an adaptive batch processing program for solving eigenvectors during the back transformation phase, the overheads of launching kernels, as well as the GPU memory requirements, can be reduced by several orders of magnitude. Several numerical examples have been employed to validate the efficiency and practicality of the novel GPU-accelerated heterogeneous strategy. The results demonstrate that the computational efficiency of the novel strategy using one GPU can increase to 3.0x that of the original parallel AMLS method when 16 CPU threads are used.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] GPU-Accelerated Finite Element Method
    Dziekonski, Adam
    Lamecki, Adam
    Mrozowski, Michal
    2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
  • [2] An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications
    Zhou, Keren
    Meng, Xiaozhu
    Sai, Ryuichi
    Grubisic, Dejan
    Mellor-Crummey, John
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (04) : 854 - 865
  • [3] An Effective Matrix Compression Method for GPU-Accelerated Thermal Analysis
    Chiou, Lih-Yih
    Lu, Liang-Ying
    Lin, Chieh-Yu
    2015 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2015,
  • [4] GPU-Accelerated Static Timing Analysis
    Guo, Zizheng
    Huang, Tsung-Wei
    Lin, Yibo
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [5] GPU-accelerated ADI-PE method for analysis of EM scatterings
    He, Zi
    Chen, Ru Shan
    ELECTRONICS LETTERS, 2015, 51 (21) : 1652 - 1653
  • [6] TORCWA: GPU-accelerated Fourier modal method and gradient-based optimization for metasurface design
    Kim, Changhyun
    Lee, Byoungho
    COMPUTER PHYSICS COMMUNICATIONS, 2023, 282
  • [7] Vehicle vibration analysis with Automated Multilevel Substructuring
    Chargin, MK
    Bennighof, JK
    NUMERICAL ANALYSIS AND SIMULATION IN VEHICLE ENGINEERING, 2002, 1701 : 153 - 163
  • [8] Measurement and analysis of GPU-accelerated applications with HPCToolkit
    Zhou, Keren
    Adhianto, Laksono
    Anderson, Jonathon
    Cherian, Aaron
    Grubisic, Dejan
    Krentel, Mark
    Liu, Yumeng
    Meng, Xiaozhu
    Mellor-Crummey, John
    PARALLEL COMPUTING, 2021, 108
  • [9] GPU-accelerated differential dependency network analysis
    Speyer, Gil
    Rodriguez, Juan J.
    Bencomo, Tomas
    Kim, Seungchan
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 410 - 414
  • [10] A Tool for Performance Analysis of GPU-Accelerated Applications
    Zhou, Keren
    Mellor-Crummey, John
    PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO '19), 2019, : 282 - 282