Divide and Conquer Parallelization of Finite Element Method Assembly Divide and Conquer Parallelization of Finite Element Method Assembly

被引:3
|
作者
Thebault, Loic [1 ]
Petit, Eric [1 ]
Tchiboukdjian, Marc [3 ]
Quang Dinh [2 ]
Jalby, William [1 ,3 ]
机构
[1] Univ Versailles, PRISM, Versailles, France
[2] Dassault Aviat, St Cloud, France
[3] Exascale Comp Res, Versailles, France
关键词
Divide and Conquer; Task; Cilk; Mesh Partitioning; CFD; FEM Assembly;
D O I
10.3233/978-1-61499-381-0-753
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Relying solely on domain decomposition and distributed memory parallelism can limit the performance on current supercomputers. At scale, a larger number of smaller domains can lead to an increased communication volume and to load balancing issues. Moreover, the decreasing memory per core is not compatible with the memory overhead of a finer domain decomposition. A popular alternative is to use shared memory parallelism in addition to the domain decomposition. In the context of Finite Element Method, FEM, one of the challenging steps to parallelize in shared memory is the matrix assembly. In this paper, we propose and evaluate a Divide and Conquer, D&C, algorithm to efficiently parallelize the FEM assembly. We compare this hybrid approach using D&C to the pure domain decomposition and to a state-of-the-art hybrid approach using mesh coloring. Our target application is an industrial fluid dynamics code, developed by Dassault Aviation and parallelized with MPI domain decomposition. The original Fortran code has been modified with minimum intrusion. Our D&C approach uses task parallelism with Intel Cilk+. Preliminary results show a good data locality and a 14% performance improvement on a 12 cores 2 sockets Westmere-EP node.
引用
收藏
页码:753 / 762
页数:10
相关论文
共 50 条
  • [1] PARALLELIZATION OF ASSEMBLY OPERATION IN FINITE ELEMENT METHOD
    Bosansky, Michal
    Patzak, Borek
    [J]. ACTA POLYTECHNICA, 2020, 60 (01) : 25 - 37
  • [2] Automatic parallelization of divide and conquer algorithms
    Rugina, R
    Rinard, M
    [J]. ACM SIGPLAN NOTICES, 1999, 34 (08) : 72 - 83
  • [3] Parallelization of divide-and-conquer eigenvector accumulation
    Gansterer, WN
    Zottl, J
    [J]. EURO-PAR 2005 PARALLEL PROCESSING, PROCEEDINGS, 2005, 3648 : 847 - 856
  • [4] A divide and conquer real space finite-element Hartree-Fock method
    Alizadegan, R.
    Hsia, K. J.
    Martinez, T. J.
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2010, 132 (03):
  • [5] AUTOMATIC PARALLELIZATION OF DIVIDE-AND-CONQUER ALGORITHMS
    FREISLEBEN, B
    KIELMANN, T
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1992, 634 : 849 - 850
  • [6] ON PARALLELIZATION OF ASSEMBLY OPERATIONS IN FINITE ELEMENT SOFTWARE
    Bosansky, M.
    Patzak, B.
    [J]. ENGINEERING MECHANICS 2016, 2016, : 82 - 85
  • [7] Experiences on parallelization of Divide and Conquer algorithms with Parallel Paradigms
    Aversa, R
    Di Martino, B
    [J]. PARALLEL COMPUTING TECHNOLOGIES, 1999, 1662 : 473 - 476
  • [8] Modular Divide-and-Conquer Parallelization of Nested Loops
    Farzan, Azadeh
    Nicolet, Victor
    [J]. PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, : 610 - 624
  • [9] Massive parallelization of divide-and-conquer algorithms over powerlists
    Achatz, K
    Schulte, W
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 1996, 26 (1-3) : 59 - 78
  • [10] Unified framework for the parallelization of divide and conquer based tridiagonal systems
    Lopez, J
    Plata, O
    Arguello, F
    Zapata, EL
    [J]. PARALLEL COMPUTING, 1997, 23 (06) : 667 - 686