Bayesian Optimization for auto-tuning GPU kernels

被引:8
|
作者
Willemsen, Floris-Jan [1 ]
van Nieuwpoort, Rob [1 ]
van Werkhoven, Ben [2 ]
机构
[1] Univ Amsterdam, Netherlands eSci Ctr, Amsterdam, Netherlands
[2] Netherlands eSci Ctr, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Optimization; Bayesian Optimization; autotuning; GPU Computing; machine learning;
D O I
10.1109/PMBS54543.2021.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a nonconvex search space, using an expensive to evaluate function with unknown derivative. These characteristics make a good candidate for Bayesian Optimization, which has not been applied to this problem before. However, the application of Bayesian Optimization to this problem is challenging. We demonstrate how to deal with the rough, discrete, constrained search spaces, containing invalid configurations. We introduce a novel contextual variance exploration factor, as well as new acquisition functions with improved scalability, combined with an informed acquisition function selection mechanism. By comparing the performance of our Bayesian Optimization implementation on various test cases to the existing search strategies in Kernel Tuner, as well as other Bayesian Optimization implementations, we demonstrate that our search strategies generalize well and consistently outperform other search strategies by a wide margin.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 50 条
  • [1] Benchmarking Optimization Algorithms for Auto-Tuning GPU Kernels
    Schoonhoven, Richard Arnoud
    van Werkhoven, Ben
    Batenburg, Kees Joost
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (03) : 550 - 564
  • [2] Bayesian Optimization for Auto-tuning Convolution Neural Network on GPU
    Zhu, Huming
    Liu, Chendi
    Zhang, Lingyun
    Dong, Ximiao
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT VI, 2024, 14492 : 478 - 489
  • [3] Accelerated Auto-Tuning of GPU Kernels for Tensor Computations
    Li, Chendi
    Xu, Yufan
    Saravani, Sina Mahdipour
    Sadayappan, P.
    PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024, 2024, : 549 - 561
  • [4] A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility
    Li, Jialin
    Ye, Huang
    Tian, Shaobo
    Li, Xinyuan
    Zhang, Jian
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 863 - 874
  • [5] Weak in the NEES?: Auto-tuning Kalman Filters with Bayesian Optimization
    Chen, Zhaozhong
    Heckman, Christoffer
    Julier, Simon
    Ahmed, Nisar
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 1072 - 1079
  • [6] Kalman Filter Auto-Tuning With Consistent and Robust Bayesian Optimization
    Chen, Zhaozhong
    Biggie, Harel
    Ahmed, Nisar
    Julier, Simon
    Heckman, Christoffer
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 2236 - 2250
  • [7] Optimizing and Auto-tuning Belief Propagation on the GPU
    Grauer-Gray, Scott
    Cavazos, John
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2011, 6548 : 121 - 135
  • [8] Toward Techniques for Auto-tuning GPU Algorithms
    Davidson, Andrew
    Owens, John
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT II, 2012, 7134 : 110 - 119
  • [9] Adaptive GPU Array Layout Auto-Tuning
    Weber, Nicolas
    Goesele, Michael
    PROCEEDINGS OF THE ACM WORKSHOP ON SOFTWARE ENGINEERING METHODS FOR PARALLEL AND HIGH PERFORMANCE APPLICATIONS (SEM4HPC'16), 2016, : 21 - 28
  • [10] Auto-tuning Parameter Choices in HPC Applications using Bayesian Optimization
    Menon, Harshitha
    Bhatele, Abhinav
    Gamblin, Todd
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 831 - 840