Efficient Heuristic Adaptive Quadrature on GPUs: Design and Evaluation

被引:6
|
作者
Thuerck, Daniel [1 ]
Widmer, Sven [2 ]
Kuijper, Arjan [1 ,3 ]
Goesele, Michael [2 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
[2] Tech Univ Darmstadt, Grad Sch Computat Engn, Darmstadt, Germany
[3] Fraunhofer IGD, Darmstadt, Germany
关键词
Numerical integration; GPGPU; Numerical algorithms; Heuristics; Interval estimation;
D O I
10.1007/978-3-642-55224-3_61
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Numerical integration is a common sub-problem in many applications. It can be solved easily in CPU-based applications using adaptive quadrature such as the adaptive Simpson's rule. These algorithms rely, however, on error estimation yielding a significant computational overhead. In addition, they require recursive function evaluations, which are not well suited for parallel computation on graphics processing units (GPUs) due to warp divergence issues. In this paper, we introduce heuristic forward quadrature as an alternative that is not only more efficient than traditional methods, but also better suited for accelerated massively-parallel calculation on GPUs. Additionally, we will give an error estimate for our method and demonstrate performance results for 1D and 2D integral applications which show that the algorithm leverages quadrature for the efficient implementation on GPUs.
引用
收藏
页码:652 / 662
页数:11
相关论文
共 50 条
  • [1] Efficient calculation of degenerate atomic rates by numerical quadrature on GPUs
    Aslanyan, V.
    Aslanyan, A. G.
    Tallents, G. J.
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2017, 219 : 261 - 268
  • [2] FANG: Fast and Efficient Successor-State Generation for Heuristic Optimization on GPUs
    Koester, Marcel
    Gross, Julian
    Krueger, Antonio
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 223 - 241
  • [3] A Memory Efficient Algorithm for Adaptive Multidimensional Integration with Multiple GPUs
    Arumugam, Kamesh
    Godunov, Alexander
    Ranjan, Desh
    Terzic, Balsa
    Zuhair, Mohammad
    [J]. 2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 169 - 175
  • [4] GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs
    Wang, Yuke
    Feng, Boyuan
    Li, Gushu
    Li, Shuangchen
    Deng, Lei
    Xie, Yuan
    Ding, Yufei
    [J]. PROCEEDINGS OF THE 15TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '21), 2021, : 515 - 531
  • [5] Heuristic Evaluation for Gameful Design
    Tondello, Gustavo F.
    Kappen, Dennis L.
    Mekler, Elisa D.
    Ganaba, Marim
    Nacke, Lennart E.
    [J]. CHI PLAY 2016: PROCEEDINGS OF THE ANNUAL SYMPOSIUM ON COMPUTER-HUMAN INTERACTION IN PLAY COMPANION, 2016, : 315 - 323
  • [6] SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
    Lee, Jiwon
    Lee, Ju Min
    Oh, Yunho
    Song, William J.
    Ro, Won Woo
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 1195 - 1207
  • [7] An Efficient Deterministic Parallel Algorithm for Adaptive Multidimensional Numerical Integration on GPUs
    Arumugam, Kamesh
    Godunov, Alexander
    Ranjan, Desh
    Terzic, Balsa
    Zubair, Mohammad
    [J]. 2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 486 - 491
  • [8] Adaptive Critic Design and Heuristic Search for Optimization
    Koprinkova-Hristova, Petia
    [J]. LARGE-SCALE SCIENTIFIC COMPUTING, LSSC 2013, 2014, 8353 : 248 - 255
  • [9] Design and Performance Evaluation of Image Processing Algorithms on GPUs
    Park, In Kyu
    Singhal, Nitin
    Lee, Man Hee
    Cho, Sungdae
    Kim, Chris W.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (01) : 91 - 104
  • [10] Design and implementation of an efficient integer count sort in CUDA GPUs
    Kolonias, Vasileios
    Voyiatzis, Artemios G.
    Goulas, George
    Housos, Efthymios
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (18): : 2365 - 2381