Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

被引:0
|
作者
Obrecht, Christian [1 ]
Kuznik, Frederic [1 ]
Tourancheau, Bernard [2 ]
Roux, Jean-Jacques [1 ]
机构
[1] Univ Lyon, INSA Lyon, CNRS,Ctr Thermique Lyon, UMR 5008, Bat Sadi Carnot,9 Rue Phys, F-69621 Villeurbanne, France
[2] UCB Lyon 1, UMR 5668,ENS Lyon, INRIA, Lab Informat Parallelisme,CNRS, Lyon 63964, France
关键词
GPU computing; CUDA; lattice Boltzmann method; CFD;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.
引用
收藏
页码:151 / +
页数:2
相关论文
共 50 条
  • [31] Memory-efficient Lattice Boltzmann Method for low Reynolds number flows
    Matyka, Maciej
    Dzikowski, Michal
    COMPUTER PHYSICS COMMUNICATIONS, 2021, 267
  • [32] Efficient Simulation of Reaction Systems on Graphics Processing Units
    Nobile, Marco S.
    Porreca, Antonio E.
    Spolaor, Simone
    Manzoni, Luca
    Cazzaniga, Paolo
    Mauri, Giancarlo
    Besozzi, Daniela
    FUNDAMENTA INFORMATICAE, 2017, 154 (1-4) : 307 - 321
  • [33] Tool Support for Efficient Programming of Graphics Processing Units
    Damevski, Kostadin
    BRIDGING MATHEMATICS, STATISTICS, ENGINEERING AND TECHNOLOGY, 2012, 24 : 97 - 103
  • [34] GHEVC: An Efficient HEVC Decoder for Graphics Processing Units
    de Souza, Diego F.
    Ilic, Aleksandar
    Roma, Nuno
    Sousa, Leonel
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 459 - 474
  • [35] Efficient magnetohydrodynamic simulations on graphics processing units with CUDA
    Wong, Hon-Cheng
    Wong, Un-Hong
    Feng, Xueshang
    Tang, Zesheng
    COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (10) : 2132 - 2160
  • [36] Grex: An efficient MapReduce framework for graphics processing units
    Basaran, Can
    Kang, Kyoung-Don
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (04) : 522 - 533
  • [37] Energy Efficient Iris Recognition With Graphics Processing Units
    Rakvic, Ryan
    Broussard, Randy
    Ngo, Hau
    IEEE ACCESS, 2016, 4 : 2831 - 2839
  • [38] A single-step and simplified graphics processing unit lattice Boltzmann method for high turbulent flows
    Delgado-Gutierrez, Arturo
    Marzocca, Pier
    Cardenas, Diego
    Probst, Oliver
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2021, 93 (07) : 2339 - 2361
  • [39] Demand look-ahead memory access scheduling for 3D graphics processing units
    Hsiao, Chih-Chieh
    Lo, Min-Jen
    Chu, Slo-Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (03) : 1391 - 1416
  • [40] Lattice Boltzmann Method for Modelling of Biological Phenomena
    Noel, Romain
    Ge, Fei
    Zhang, Yue
    Navarro, Laurent
    Courbebaisse, Guy
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2654 - 2658