A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code

被引:0
|
作者
Biferale, Luca [1 ]
Mantovani, Filippo [2 ]
Pivanti, Marcello [3 ,4 ]
Pozzati, Fabio [5 ]
Sbragaglia, Mauro [1 ]
Scagliarini, Andrea [6 ]
Schifano, Sebastiano Fabio [3 ,4 ]
Toschi, Federico [7 ,8 ]
Tripiccione, Raffaele [3 ,4 ]
机构
[1] Univ Roma Tor Vergata, Rome, Italy
[2] Deutsch Elekt Synchrotron DESY, Zeuthen, Germany
[3] Univ Ferrara, Ferrara, Italy
[4] INFN, Ferrara, Italy
[5] Fondaz Bruno Kessler Trento, Trento, Italy
[6] Univ Barcelona, Barcelona, Spain
[7] Eindhoven Univ Technol, Eindhoven, Netherlands
[8] CNR, IAC, Rome, Italy
关键词
Computational fluid-dynamics; Lattice Boltzmann methods; GP-GPUs computing; PERFORMANCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluster based on Nvidia Fermi processors. We analyze how to optimize the algorithm for GP-GPU architectures, describe the implementation choices that we have adopted and compare our performance results with an implementation optimized for latest generation multi-core CPUs. Our program runs at approximate to 30% of the double-precision peak performance of one GPU and shows almost linear scaling when run on the multi-GPU cluster.
引用
收藏
页码:640 / 650
页数:11
相关论文
共 50 条
  • [31] An Optimized Lattice Boltzmann Code for BlueGene/Q
    Pivanti, Marcello
    Mantovani, Filippo
    Schifano, Sebastiano Fabio
    Tripiccione, Raffaele
    Zenesini, Luca
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 385 - 394
  • [32] Accelerating physical simulations from a multicomponent Lattice Boltzmann method on a single-node multi-GPU architecture
    Duchateau, Julien
    Rousselle, Francois
    Maquignon, Nicolas
    Roussel, Gilles
    Renaud, Christophe
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC), 2015, : 315 - 322
  • [33] Multi-GPU Acceleration of the iPIC3D Implicit Particle-in-Cell Code
    Sishtla, Chaitanya Prasad
    Chien, Steven W. D.
    Olshevsky, Vyacheslav
    Laure, Erwin
    Markidis, Stefano
    [J]. COMPUTATIONAL SCIENCE - ICCS 2019, PT V, 2019, 11540 : 612 - 618
  • [34] Simulation of droplet dynamic with high density ratio two-phase lattice Boltzmann model on multi-GPU cluster
    Huang, Tzu-Chun
    Chang, Chien-Yi
    Lin, Chao-An
    [J]. COMPUTERS & FLUIDS, 2018, 173 : 80 - 87
  • [35] Parallelization of lattice Boltzmann software for execution on multi-GPU clusters with application to the simulation of blood flow through human arteries
    Djukic, Tijana
    Filipovic, Nenad
    [J]. 2021 IEEE 21ST INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (IEEE BIBE 2021), 2021,
  • [36] Multi-GPU Implementation of k-Nearest Neighbor Algorithm
    Masek, Jan
    Burget, Kadim
    Karasek, Jan
    Uher, Vaclav
    Dutta, Malay Kishore
    [J]. 2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015, : 764 - 767
  • [37] Lattice Boltzmann Simulations at Petascale on Multi-GPU Systems with Asynchronous Data Transfer and Strictly Enforced Memory Read Alignment
    Robertsen, Fredrik
    Westerholm, Jan
    Mattila, Keijo
    [J]. 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 604 - 609
  • [38] Multi-GPU Implementation of the Uniformization Method for Solving Markov Models
    Karwacki, Marek
    Bylina, Beata
    Bylina, Jaroslaw
    [J]. 2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 533 - 537
  • [39] Multi-GPU implementation of a VMAT treatment plan optimization algorithm
    Tian, Zhen
    Peng, Fei
    Folkerts, Michael
    Tan, Jun
    Jia, Xun
    Jiang, Steve B.
    [J]. MEDICAL PHYSICS, 2015, 42 (06) : 2841 - 2852
  • [40] Efficient implementation of data flow graphs on multi-gpu clusters
    Vincent Boulos
    Sylvain Huet
    Vincent Fristot
    Luc Salvo
    Dominique Houzet
    [J]. Journal of Real-Time Image Processing, 2014, 9 : 217 - 232