Efficient Hyper-parameter Optimization with Cubic Regularization

被引:0
|
作者
Shen, Zhenqian [1 ]
Yang, Hansi [2 ]
Li, Yong [1 ]
Kwok, James [2 ]
Yao, Quanming [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As hyper-parameters are ubiquitous and can significantly affect the model performance, hyper-parameter optimization is extremely important in machine learning. In this paper, we consider a sub-class of hyper-parameter optimization problems, where the hyper-gradients are not available. Such problems frequently appear when the performance metric is non-differentiable or the hyper-parameter is not continuous. However, existing algorithms, like Bayesian optimization and reinforcement learning, often get trapped in local optimals with poor performance. To address the above limitations, we propose to use cubic regularization to accelerate convergence and avoid saddle points. First, we adopt stochastic relaxation, which allows obtaining gradient and Hessian information without hyper-gradients. Then, we exploit the rich curvature information by cubic regularization. Theoretically, we prove that the proposed method can converge to approximate second-order stationary points, and the convergence is also guaranteed when the lower-level problem is inexactly solved. Experiments on synthetic and real-world data demonstrate the effectiveness of our proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A Hyper-Parameter Optimization Approach to Automated Radiotherapy Treatment Planning
    Haaf, S.
    Kearney, V.
    Interian, Y.
    Valdes, G.
    Solberg, T.
    Perez-Andujar, A.
    MEDICAL PHYSICS, 2017, 44 (06) : 2901 - 2901
  • [32] Hyper-parameter optimization of gradient boosters for flood susceptibility analysis
    Lai, Tuan Anh
    Nguyen, Ngoc-Thach
    Bui, Quang-Thanh
    TRANSACTIONS IN GIS, 2023, 27 (01) : 224 - 238
  • [33] Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search
    Hu, Yi-Qi
    Yu, Yang
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2276 - 2282
  • [34] Quadratic optimization for the hyper-parameter based on maximum entropy search
    Li, Yuqi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 4991 - 5006
  • [35] Hyper-parameter optimization in classification: To-do or not-to-do
    Ngoc Tran
    Schneider, Jean-Guy
    Weber, Ingo
    Qin, A. K.
    PATTERN RECOGNITION, 2020, 103
  • [36] Continuous Hyper-parameter OPtimization (CHOP) in an ensemble Kalman filter
    Luo, Xiaodong
    Xia, Chuan-An
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2022, 8
  • [37] Weighted Voting Based Ensemble Classification with Hyper-parameter Optimization
    Gokalp, Osman
    Tasci, Erdal
    2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 550 - 553
  • [38] Hyper-Parameter Optimization for Emotion Detection using Physiological Signals
    Albraikan, Amani
    Tobon, Diana P.
    El Saddik, Abdulmotaleb
    2018 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2018,
  • [39] Deep Learning Hyper-Parameter Optimization for Video Analytics in Clouds
    Yaseen, Muhammad Usman
    Anjum, Ashiq
    Rana, Omer
    Antonopoulos, Nikolaos
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (01): : 253 - 264
  • [40] An Efficient Botnet Detection Methodology using Hyper-parameter Optimization Trough Grid-Search Techniques
    Gonzalez-Cuautle, David
    Yair Corral-Salinas, Uriel
    Sanchez-Perez, Gabriel
    Perez-Meana, Hector
    Toscano-Medina, Karina
    Hernandez-Suarez, Aldo
    2019 7TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF), 2019,