Multi-GPU acceleration of large-scale density-based topology optimization

被引:18
|
作者
Herrero-Perez, David [1 ,2 ]
Martinez Castejon, Pedro J. [2 ]
机构
[1] Tech Univ Cartagena, Computat Mech & Sci Comp Grp, Murcia 30202, Spain
[2] Tech Univ Cartagena, Campus Muralla del Mar, Murcia 30202, Spain
关键词
Topology optimization; GPU computing; Multi-GPU systems; Finite element analysis; Aggregation AMG; EFFICIENT; DESIGN; SOLVER;
D O I
10.1016/j.advengsoft.2021.103006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work presents a parallel implementation of density-based topology optimization using distributed GPU computing systems. The use of multiple GPU devices allows us accelerating the computing process and increasing the device memory available for GPU computing. This increment of device memory enables us to address large models that commonly do not fit into one GPU device. The most modern scientific computers incorporate these devices to design energy-efficient, low-cost, and high-computing power systems. However, we should adopt the proper techniques to take advantage of the computational resources of such high-performance many-core computing systems. It is well-known that the bottleneck of density-based topology optimization is the solving of the linear elasticity problem using Finite Element Analysis (FEA) during the topology optimization iterations. We solve the linear system of equations obtained from FEA using a distributed conjugate gradient solver preconditioned by a smooth aggregation-based algebraic multigrid (AMG) using GPU computing with multiple devices. The use of aggregation-based AMG reduces memory requirements and improves the efficiency of the interpolation operation. This fact is rewarding for GPU computing. We evaluate the performance and scalability of the distributed GPU system using structured and unstructured meshes. We also test the performance using different 3D finite elements and relaxing operators. Besides, we evaluate the use of numerical approaches to increase the topology optimization performance. Finally, we present a comparison between the many-core computing instance and one efficient multi-core implementation to highlight the advantages of using GPU computing in large-scale density-based topology optimization problems.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Large-scale robust topology optimization using multi-GPU systems
    Martinez-Frutos, Jesus
    Herrero-Perez, David
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2016, 311 : 393 - 414
  • [2] Acceleration of large-scale CGH generation using multi-GPU cluster
    Watanabe, Shinpei
    Jackin, Boaz Jessie
    Ohkawa, Takeshi
    Ootsu, Kanemitsu
    Yokota, Takashi
    Hayasaki, Yoshio
    Yatagai, Toyohiko
    Baba, Takanobu
    [J]. 2017 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2017, : 589 - 593
  • [3] A multi-GPU algorithm for large-scale neuronal networks
    de Camargo, Raphael Y.
    Rozante, Luiz
    Song, Siang W.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (06): : 556 - 572
  • [4] Large-Scale Graph Processing on Multi-GPU Platforms
    Zhang H.
    Zhang L.
    Wu Y.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (02): : 273 - 288
  • [5] Optimization of Large-Scale Sparse Matrix-Vector Multiplication on Multi-GPU Systems
    Gao, Jianhua
    Ji, Weixing
    Wang, Yizhuo
    [J]. ACM Transactions on Architecture and Code Optimization, 2024, 21 (04)
  • [6] Multi-GPU Approach for Large-Scale Multiple Sequence Alignment
    Siqueira, Rodrigo A. de O.
    Stefanes, Marco A.
    Rozante, Luiz C. S.
    Martins-Jr, David C.
    de Souza, Jorge E. S.
    Araujo, Eloi
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT I, 2021, 12949 : 560 - 575
  • [7] Length scale control in density-based multi-material topology optimization
    Song, Longlong
    Zhao, Jian
    Gao, Tong
    Li, Jiajia
    Tang, Lei
    Li, Yang
    Zhang, Weihong
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 401
  • [8] Length scale and manufacturability in density-based topology optimization
    Boyan S. Lazarov
    Fengwen Wang
    Ole Sigmund
    [J]. Archive of Applied Mechanics, 2016, 86 : 189 - 218
  • [9] Length scale and manufacturability in density-based topology optimization
    Lazarov, Boyan S.
    Wang, Fengwen
    Sigmund, Ole
    [J]. ARCHIVE OF APPLIED MECHANICS, 2016, 86 (1-2) : 189 - 218
  • [10] FL-MISR: fast large-scale multi-image super-resolution for computed tomography based on multi-GPU acceleration
    Sun, Kaicong
    Tran, Trung-Hieu
    Guhathakurta, Jajnabalkya
    Simon, Sven
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (02) : 331 - 344