A GPU-Accelerated automated multilevel substructuring method for modal analysis of structures

被引：0

作者：

Wang, Guidong ^{[1
]}

Wang, Yujie ^{[1
]}

Chen, Zeyu ^{[1
]}

Wang, Feiqi ^{[1
]}

Li, She ^{[1
]}

Cui, Xiangyang ^{[1
,2
]}

机构：

[1] Hunan Univ, State Key Lab Adv Design & Mfg Technol Vehicle, Changsha 410082, Peoples R China

[2] Hunan Maixi Software Co Ltd, Changsha 410082, Peoples R China

来源：

COMPUTERS & STRUCTURES | 2024年 / 305卷

基金：

美国国家科学基金会;

关键词：

AMLS method; Heterogeneous parallel computing; Finite element analysis; Eigenvalue problem; ALGORITHM; SYSTEMS;

D O I：

10.1016/j.compstruc.2024.107516

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this work, a novel GPU-accelerated heterogeneous method for the automated multilevel substructuring method(HAMLS) is presented for dealing large finite element models in structural dynamics. Different parallel modes based on node, subtree, and eigenpair have been developed in the solution steps of AMLS to achieve a heterogeneous strategy. First, a new data management method is designed during the model transformation phase to eliminate the determinacy race in the parallel strategy of the separator tree. Considering the distribution characteristics of the nodes in the separator tree and the dependence of node tasks, a load balancing heterogeneous parallel strategy is designed to take full advantage of hosts and devices. By developing an adaptive batch processing program for solving eigenvectors during the back transformation phase, the overheads of launching kernels, as well as the GPU memory requirements, can be reduced by several orders of magnitude. Several numerical examples have been employed to validate the efficiency and practicality of the novel GPU-accelerated heterogeneous strategy. The results demonstrate that the computational efficiency of the novel strategy using one GPU can increase to 3.0x that of the original parallel AMLS method when 16 CPU threads are used.

引用

页数：14

共 50 条

[1] GPU-Accelerated Finite Element Method
Dziekonski, Adam
Lamecki, Adam
Mrozowski, Michal
2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
[2] An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications
Zhou, Keren
Meng, Xiaozhu
Sai, Ryuichi
Grubisic, Dejan
Mellor-Crummey, John
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (04) : 854 - 865
[3] An Effective Matrix Compression Method for GPU-Accelerated Thermal Analysis
Chiou, Lih-Yih
Lu, Liang-Ying
Lin, Chieh-Yu
2015 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2015,
[4] GPU-Accelerated Static Timing Analysis
Guo, Zizheng
Huang, Tsung-Wei
Lin, Yibo
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
[5] GPU-accelerated ADI-PE method for analysis of EM scatterings
He, Zi
Chen, Ru Shan
ELECTRONICS LETTERS, 2015, 51 (21) : 1652 - 1653
[6] TORCWA: GPU-accelerated Fourier modal method and gradient-based optimization for metasurface design
Kim, Changhyun
Lee, Byoungho
COMPUTER PHYSICS COMMUNICATIONS, 2023, 282
[7] Vehicle vibration analysis with Automated Multilevel Substructuring
Chargin, MK
Bennighof, JK
NUMERICAL ANALYSIS AND SIMULATION IN VEHICLE ENGINEERING, 2002, 1701 : 153 - 163
[8] Measurement and analysis of GPU-accelerated applications with HPCToolkit
Zhou, Keren
Adhianto, Laksono
Anderson, Jonathon
Cherian, Aaron
Grubisic, Dejan
Krentel, Mark
Liu, Yumeng
Meng, Xiaozhu
Mellor-Crummey, John
PARALLEL COMPUTING, 2021, 108
[9] GPU-accelerated differential dependency network analysis
Speyer, Gil
Rodriguez, Juan J.
Bencomo, Tomas
Kim, Seungchan
2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 410 - 414
[10] A Tool for Performance Analysis of GPU-Accelerated Applications
Zhou, Keren
Mellor-Crummey, John
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO '19), 2019, : 282 - 282

← 1 2 3 4 5 →