Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

被引:0
|
作者
Wu, Jian [1 ]
Toscano-Palmerin, Saul [1 ]
Frazier, Peter, I [1 ]
Wilson, Andrew Gordon [2 ]
机构
[1] Cornell Univ, Operat Res & Informat Engn, Ithaca, NY 14850 USA
[2] NYU, Courant Inst Math Sci, New York, NY 10003 USA
关键词
SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian optimization is popular for optimizing time-consuming black-box objectives. Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck. Multi-fidelity optimization promises relief using cheaper proxies to such objectives - for example, validation error for a network trained using a subset of the training points or fewer iterations than required for convergence. We propose a highly flexible and practical approach to multi-fidelity Bayesian optimization, focused on efficiently optimizing hyperparameters for iteratively trained supervised learning models. We introduce a new acquisition function, the trace-aware knowledge-gradient, which efficiently leverages both multiple continuous fidelity controls and trace observations - values of the objective at a sequence of fidelities, available when varying fidelity using training iterations. We provide a provably convergent method for optimizing our acquisition function and show it outperforms state-of-the-art alternatives for hyperparameter tuning of deep neural networks and large-scale kernel learning.
引用
收藏
页码:788 / 798
页数:11
相关论文
共 50 条
  • [1] Improving Multi-fidelity Optimization with a Recurring Learning Rate for Hyperparameter Tuning
    Lee, HyunJae
    Lee, Gihyeon
    Kim, Junhwan
    Cho, Sungjun
    Kim, Dohyun
    Yoo, Donggeun
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2308 - 2317
  • [2] Towards Efficient Multiobjective Hyperparameter Optimization: A Multiobjective Multi-fidelity Bayesian Optimization and Hyperband Algorithm
    Chen, Zefeng
    Zhou, Yuren
    Huang, Zhengxin
    Xia, Xiaoyun
    [J]. PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT I, 2022, 13398 : 160 - 174
  • [3] Multi-fidelity Bayesian algorithm for antenna optimization
    Li, Jianxing
    Yang, An
    Tian, Chunming
    Ye, Le
    Chen, Badong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (06) : 1119 - 1126
  • [4] Multi-fidelity Bayesian algorithm for antenna optimization
    LI Jianxing
    YANG An
    TIAN Chunming
    YE Le
    CHEN Badong
    [J]. Journal of Systems Engineering and Electronics, 2022, 33 (06) : 1119 - 1126
  • [5] Multi-fidelity Bayesian Optimization of SWATH Hull Forms
    Bonfiglio, Luca
    Perdikaris, Paris
    Brizzolara, Stefano
    [J]. JOURNAL OF SHIP RESEARCH, 2020, 64 (02): : 154 - 170
  • [6] Multi-fidelity cost-aware Bayesian optimization
    Foumani, Zahra Zanjani
    Shishehbor, Mehdi
    Yousefpour, Amin
    Bostanabad, Ramin
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2023, 407
  • [7] YAHPO Gym - An Efficient Multi-Objective Multi-Fidelity Benchmark for Hyperparameter Optimization
    Pfisterer, Florian
    Schneider, Lennart
    Moosbauer, Julia
    Binder, Martin
    Bischl, Bernd
    [J]. INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 188, 2022, 188
  • [8] A General Framework for Multi-fidelity Bayesian Optimization with Gaussian Processes
    Song, Jialin
    Chen, Yuxin
    Yue, Yisong
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [9] Combining multi-fidelity modelling and asynchronous batch Bayesian Optimization
    Folch, Jose Pablo
    Lee, Robert M.
    Shafei, Behrang
    Walz, David
    Tsay, Calvin
    van der Wilk, Mark
    Misener, Ruth
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2023, 172
  • [10] Multi-Fidelity Bayesian Optimization via Deep Neural Networks
    Li, Shibo
    Xing, Wei
    Kirby, Robert M.
    Zhe, Shandian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33