Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning

被引:60
|
作者
Koch, Patrick [1 ]
Golovidov, Oleg [1 ]
Gardner, Steven [1 ]
Wujek, Brett [1 ]
Griffin, Joshua [1 ]
Xu, Yan [1 ]
机构
[1] SAS Inst Inc, Cary, NC 27513 USA
关键词
Derivative-free Optimization; Stochastic Optimization; Bayesian Optimization; Hyperparameters; Distributed Computing System; SEARCH;
D O I
10.1145/3219819.3219837
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning applications often require hyperparameter tuning. The hyperparameters usually drive both the efficiency of the model training process and the resulting model quality. For hyperparameter tuning, machine learning algorithms are complex black-boxes. This creates a class of challenging optimization problems, whose objective functions tend to be nonsmooth, discontinuous, unpredictably varying in computational expense, and include continuous, categorical, and/or integer variables. Further, function evaluations can fail for a variety of reasons including numerical difficulties or hardware failures. Additionally, not all hyperparameter value combinations are compatible, which creates so called hidden constraints. Robust and efficient optimization algorithms are needed for hyper-parameter tuning. In this paper we present an automated parallel derivative-free optimization framework called Autotune, which combines a number of specialized sampling and search methods that are very effective in tuning machine learning models despite these challenges. Autotune provides significantly improved models over using default hyperparameter settings with minimal user interaction on real-world applications. Given the inherent expense of training numerous candidate models, we demonstrate the effectiveness of Autotune's search methods and the efficient distributed and parallel paradigms for training and tuning models, and also discuss the resource trade-offs associated with the ability to both distribute the training process and parallelize the tuning process.
引用
收藏
页码:443 / 452
页数:10
相关论文
共 50 条
  • [21] Openly Revisiting Derivative-Free Optimization
    Rapin, Jeremy
    Dorval, Pauline
    Pondard, Jules
    Vasilache, Nicolas
    Cauwet, Marie-Liesse
    Couprie, Camille
    Teytaud, Olivier
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 267 - 268
  • [22] BENCHMARKING DERIVATIVE-FREE OPTIMIZATION ALGORITHMS
    More, Jorge J.
    Wild, Stefan M.
    SIAM JOURNAL ON OPTIMIZATION, 2009, 20 (01) : 172 - 191
  • [23] Derivative-free Methods for Structural Optimization
    Ilunga, Guilherme
    Leitao, Antonio
    ECAADE 2018: COMPUTING FOR A BETTER TOMORROW, VO 1, 2018, : 179 - 186
  • [24] Derivative-Free Local Tuning and Local Improvement Techniques Embedded in the Univariate Global Optimization
    Yaroslav D. Sergeyev
    Marat S. Mukhametzhanov
    Dmitri E. Kvasov
    Daniela Lera
    Journal of Optimization Theory and Applications, 2016, 171 : 186 - 208
  • [25] An Accelerated Multistart Derivative-Free Framework for the Beam Angle Optimization Problem in IMRT
    Rocha, Humberto
    Dias, Joana M.
    Ventura, Tiago
    Ferreira, Brigida C.
    Lopes, Maria do Carmo
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT I, 2016, 9786 : 232 - 245
  • [26] Derivative-Free Local Tuning and Local Improvement Techniques Embedded in the Univariate Global Optimization
    Sergeyev, Yaroslav D.
    Mukhametzhanov, Marat S.
    Kvasov, Dmitri E.
    Lera, Daniela
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 171 (01) : 186 - 208
  • [27] A trust-region framework for derivative-free mixed-integer optimization
    Torres, Juan J.
    Nannicini, Giacomo
    Traversi, Emiliano
    Wolfler Calvo, Roberto
    MATHEMATICAL PROGRAMMING COMPUTATION, 2024, 16 (03) : 369 - 422
  • [28] A discontinuous derivative-free optimization framework for multi-enterprise supply chain
    Atharv Bhosekar
    Marianthi Ierapetritou
    Optimization Letters, 2020, 14 : 959 - 988
  • [29] A derivative-free multistart framework for an automated noncoplanar beam angle optimization in IMRT
    Rocha, Humberto
    Dias, Joana
    Ventura, Tiago
    Ferreira, Brigida
    Lopes, Maria do Carmo
    MEDICAL PHYSICS, 2016, 43 (10) : 5514 - 5526
  • [30] A discontinuous derivative-free optimization framework for multi-enterprise supply chain
    Bhosekar, Athary
    Lerapetritou, Marianthi
    OPTIMIZATION LETTERS, 2020, 14 (04) : 959 - 988