Efficient Hyper-parameter Optimization with Cubic Regularization

被引:0
|
作者
Shen, Zhenqian [1 ]
Yang, Hansi [2 ]
Li, Yong [1 ]
Kwok, James [2 ]
Yao, Quanming [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As hyper-parameters are ubiquitous and can significantly affect the model performance, hyper-parameter optimization is extremely important in machine learning. In this paper, we consider a sub-class of hyper-parameter optimization problems, where the hyper-gradients are not available. Such problems frequently appear when the performance metric is non-differentiable or the hyper-parameter is not continuous. However, existing algorithms, like Bayesian optimization and reinforcement learning, often get trapped in local optimals with poor performance. To address the above limitations, we propose to use cubic regularization to accelerate convergence and avoid saddle points. First, we adopt stochastic relaxation, which allows obtaining gradient and Hessian information without hyper-gradients. Then, we exploit the rich curvature information by cubic regularization. Theoretically, we prove that the proposed method can converge to approximate second-order stationary points, and the convergence is also guaranteed when the lower-level problem is inexactly solved. Experiments on synthetic and real-world data demonstrate the effectiveness of our proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Generating Pool of Classifiers with Hyper-Parameter Optimization for Ensemble
    Wang, Qiushi
    Chan, Hian-Leng
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [22] RHOASo: An Early Stop Hyper-Parameter Optimization Algorithm
    Munoz Castaneda, Angel Luis
    DeCastro-Garcia, Noemi
    Escudero Garcia, David
    MATHEMATICS, 2021, 9 (18)
  • [23] AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
    Yin, Yichun
    Chen, Cheng
    Shang, Lifeng
    Jiang, Xin
    Chen, Xiao
    Liu, Qun
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5146 - 5157
  • [24] Efficient hyper-parameter determination for regularised linear BRDF parameter retrieval
    Zobitz, J. M.
    Quaife, T.
    Nichols, N. K.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (04) : 1437 - 1457
  • [25] Hyper-Parameter Optimization for Improving the Performance of Grammatical Evolution
    Wang, Hao
    Lou, Yitan
    Back, Thomas
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2649 - 2656
  • [26] KGTuner: Efficient Hyper-parameter Search for Knowledge Graph Learning
    Zhang, Yongqi
    Zhou, Zhanke
    Yao, Quanming
    Li, Yong
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2715 - 2735
  • [27] USING METAHEURISTICS FOR HYPER-PARAMETER OPTIMIZATION OF CONVOLUTIONAL NEURAL NETWORKS
    Bibaeva, Victoria
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [28] Hyper-Parameter Optimization for Privacy-Preserving Record Linkage
    Yu, Joyce
    Nabaglo, Jakub
    Vatsalan, Dinusha
    Henecka, Wilko
    Thorne, Brian
    ECML PKDD 2020 WORKSHOPS, 2020, 1323 : 281 - 296
  • [29] HYPER-PARAMETER OPTIMIZATION OF DEEP CONVOLUTIONAL NETWORKS FOR OBJECT RECOGNITION
    Talathi, Sachin S.
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3982 - 3986
  • [30] Rethinking density ratio estimation based hyper-parameter optimization
    Fan, Zi-En
    Lian, Feng
    Li, Xin-Ran
    NEURAL NETWORKS, 2025, 182