Fast hyperparameter tuning using Bayesian optimization with directional derivatives

被引:42
|
作者
Joy, Tinu Theckel [1 ]
Rana, Santu [1 ]
Gupta, Sunil [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Appl Artificial Intelligence Inst A2I2, Geelong, Vic, Australia
基金
澳大利亚研究理事会;
关键词
Bayesian optimization; Gaussian process; Hyperparameter tuning;
D O I
10.1016/j.knosys.2020.106247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we develop a Bayesian optimization based hyperparameter tuning framework inspired by statistical learning theory for classifiers. We utilize two key facts from PAC learning theory; the generalization bound will be higher for a small subset of data compared to the whole, and the highest accuracy for a small subset of data can be achieved with a simple model. We initially tune the hyperparameters on a small subset of training data using Bayesian optimization. While tuning the hyperparameters on the whole training data, we leverage the insights from the learning theory to seek more complex models. We realize this by using directional derivative signs strategically placed in the hyperparameter search space to seek a more complex model than the one obtained with small data. We demonstrate the performance of our method on the tasks of tuning the hyperparameters of several machine learning algorithms. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Efficient Deep Learning Hyperparameter Tuning using Cloud Infrastructure Intelligent Distributed Hyperparameter tuning with Bayesian Optimization in the Cloud
    Ranjit, Mercy Prasanna
    Ganapathy, Gopinath
    Sridhar, Kalaivani
    Arumugham, Vikram
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 520 - 522
  • [2] Fast Bayesian hyperparameter optimization on large datasets
    Klein, Aaron
    Falkner, Stefan
    Bartels, Simon
    Hennig, Philipp
    Hutter, Frank
    ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02): : 4945 - 4968
  • [3] Deep Learning on Active Sonar Data Using Bayesian Optimization for Hyperparameter Tuning
    Berg, Henrik
    Hjelmervik, Karl Thomas
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6546 - 6553
  • [4] Hyperparameter Tuning of the Shunt-murmur Discrimination Algorithm Using Bayesian Optimization
    Noda, Fumiya
    Nishijima, Keisuke
    Furuya, Ken'ichi
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 929 - 933
  • [6] Efficient hyperparameter tuning for predicting student performance with Bayesian optimization
    Saleh Albahli
    Multimedia Tools and Applications, 2024, 83 : 52711 - 52735
  • [7] Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning
    Wu, Jian
    Toscano-Palmerin, Saul
    Frazier, Peter, I
    Wilson, Andrew Gordon
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 788 - 798
  • [8] Efficient hyperparameter tuning for kernel ridge regression with Bayesian optimization
    Stuke, Annika
    Rinke, Patrick
    Todorovic, Milica
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (03):
  • [9] Hyperparameter Tuning for Big Data using Bayesian Optimisation
    Joy, Tinu Theckel
    Rana, Santu
    Gupta, Sunil
    Venkatesh, Svetha
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2574 - 2579
  • [10] Bayesian Hyperparameter Estimation using Gaussian Process and Bayesian Optimization
    Katakami, Shun
    Sakamoto, Hirotaka
    Okada, Masato
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2019, 88 (07)