Bayesian Optimization for auto-tuning GPU kernels

被引:8
|
作者
Willemsen, Floris-Jan [1 ]
van Nieuwpoort, Rob [1 ]
van Werkhoven, Ben [2 ]
机构
[1] Univ Amsterdam, Netherlands eSci Ctr, Amsterdam, Netherlands
[2] Netherlands eSci Ctr, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Optimization; Bayesian Optimization; autotuning; GPU Computing; machine learning;
D O I
10.1109/PMBS54543.2021.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a nonconvex search space, using an expensive to evaluate function with unknown derivative. These characteristics make a good candidate for Bayesian Optimization, which has not been applied to this problem before. However, the application of Bayesian Optimization to this problem is challenging. We demonstrate how to deal with the rough, discrete, constrained search spaces, containing invalid configurations. We introduce a novel contextual variance exploration factor, as well as new acquisition functions with improved scalability, combined with an informed acquisition function selection mechanism. By comparing the performance of our Bayesian Optimization implementation on various test cases to the existing search strategies in Kernel Tuner, as well as other Bayesian Optimization implementations, we demonstrate that our search strategies generalize well and consistently outperform other search strategies by a wide margin.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 50 条
  • [21] Auto-Tuning GEMM Kernels for a Decoupled Access/Execute Architecture Processor
    Zhao, Zeng
    Gu, Naijie
    Yang, Yangzhao
    2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 233 - 239
  • [22] Auto-tuning of Computation Kernels from an FDM Code with ppOpen-AT
    Katagiri, Takahiro
    Ohshima, Satoshi
    Matsumoto, Masaharu
    2014 IEEE 8TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANYCORE SOCS (MCSOC), 2014, : 91 - 98
  • [23] Monte Carlo Optimisation Auto-Tuning on a Multi-GPU Cluster
    Paukste, Andrius
    2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 894 - 898
  • [24] A Bayesian Network Approach for Compiler Auto-tuning for Embedded Processors
    Ashouri, Amir Hossein
    Mariani, Giovanni
    Palermo, Gianluca
    Silvano, Cristina
    2014 IEEE 12TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2014, : 90 - 97
  • [25] Tensile: Auto-tuning GEMM GPU Assembly for All Problem Sizes
    Tanner, David E.
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1066 - 1075
  • [26] Data-driven auto-tuning strategy for RTO-MPC based on Bayesian optimization
    Zhang, Lingzhi
    Xie, Lei
    Su, Hongye
    Liu, Jinfeng
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 187
  • [27] Adversarially robust Bayesian optimization for efficient auto-tuning of generic control structures under uncertainty
    Paulson, Joel A.
    Makrygiorgos, Georgios
    Mesbah, Ali
    AICHE JOURNAL, 2022, 68 (06)
  • [28] Vector optimization of combined systems of phase auto-tuning
    Okhrushchak, DV
    Skoblilova, NM
    Stasyuk, VI
    Mukhin, AM
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2003, 46 (7-8): : A30 - A34
  • [29] PERI Auto-Tuning
    Bailey, David H.
    Chame, Jacqueline
    Chen, Chun
    Dongarra, Jack
    Hall, Mary
    Hollingsworth, Jeffrey K.
    Hovland, Paul
    Moore, Shirley
    Seymour, Keith
    Shin, Jaewook
    Tiwari, Ananta
    Williams, Sam
    You, Haihang
    SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [30] Input-Aware Auto-Tuning of Compute-Bound HPC Kernels
    Tillet, Philippe
    Cox, David
    SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,