Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

被引:0
|
作者
Obrecht, Christian [1 ]
Kuznik, Frederic [1 ]
Tourancheau, Bernard [2 ]
Roux, Jean-Jacques [1 ]
机构
[1] Univ Lyon, INSA Lyon, CNRS,Ctr Thermique Lyon, UMR 5008, Bat Sadi Carnot,9 Rue Phys, F-69621 Villeurbanne, France
[2] UCB Lyon 1, UMR 5668,ENS Lyon, INRIA, Lab Informat Parallelisme,CNRS, Lyon 63964, France
关键词
GPU computing; CUDA; lattice Boltzmann method; CFD;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this work, we investigate the global memory access mechanism on recent GPUs. For the purpose of this study, we created specific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.
引用
收藏
页码:151 / +
页数:2
相关论文
共 50 条
  • [41] Demand look-ahead memory access scheduling for 3D graphics processing units
    Chih-Chieh Hsiao
    Min-Jen Lo
    Slo-Li Chu
    Multimedia Tools and Applications, 2014, 73 : 1391 - 1416
  • [42] Kinematic modelling of disc galaxies using graphics processing units
    Bekiaris, G.
    Glazebrook, K.
    Fluke, C. J.
    Abraham, R.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 455 (01) : 754 - 784
  • [43] GOST-28147 encryption implementation on graphics processing units
    Korobitsin, Victor
    Ilyin, Sergey
    ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 967 - 974
  • [44] Implementation of the Molecular Electrostatic Potential over Graphics Processing Units
    Cesar Cruz, J.
    Hernandez-Esparza, Ratymundo
    Vazquez-Mayagoitia, Alvaro
    Vargas, Rubicelia
    Garza, Jorge
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (07) : 3120 - 3127
  • [45] Parallel Implementation of the Discrete Wavelet Transform on Graphics Processing Units
    Khemiri, Randa
    Sayadi, Fatma
    Saidani, Taoufik
    Chouchene, Marwa
    Bahri, Haythem
    Tourki, Rached
    2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014), 2014, : 111 - 114
  • [46] AES encryption implementation and analysis on commodity graphics processing units
    Harrison, Owen
    Waldron, John
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS - CHES 2007, PROCEEDINGS, 2007, 4727 : 209 - +
  • [47] An efficient implementation of Bailey and Borwein’s algorithm for parallel random number generation on graphics processing units
    Gleb Beliakov
    Michael Johnstone
    Doug Creighton
    Tim Wilkin
    Computing, 2013, 95 : 309 - 326
  • [48] An efficient implementation of Bailey and Borwein's algorithm for parallel random number generation on graphics processing units
    Beliakov, Gleb
    Johnstone, Michael
    Creighton, Doug
    Wilkin, Tim
    COMPUTING, 2013, 95 (04) : 309 - 326
  • [49] Option Pricing with COS method on Graphics Processing Units
    Zhang, Bowen
    Oosterlee, Cornelis W.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2450 - +
  • [50] A dual fast gradient method for the graphics processing units
    Ramovš, Iztok
    Gerkšič, Samo
    Lotrič, Uroš
    Elektrotehniski Vestnik/Electrotechnical Review, 2019, 86 (04): : 219 - 224