On `p-hyperparameter Learning via Bilevel Nonsmooth Optimization

被引:0
|
作者
T., Okuno
A., Takeda
A., Kawana
M., Watanabe
机构
[1] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[2] Graduate School of Information Science and Technology, The University of Tokyo, Tokyo,113-8656, Japan
[3] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[4] Department of Industrial Engineering and Economics, Tokyo Institute of Technology, Tokyo,152-8550, Japan
[5] Department of Mathematical Informatics, The University of Tokyo, Tokyo,113-8656, Japan
基金
日本学术振兴会;
关键词
Computational methods;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose a bilevel optimization strategy for selecting the best hyperparameter value for the nonsmooth `p regularizer with 0 p-regularized problem as the lower-level problem. Despite the recent popularity of nonconvex `p-regularizer and the usefulness of bilevel optimization for selecting hyperparameters, algorithms for such bilevel problems have not been studied because of the difficulty of `p-regularizer. Our contribution is the proposal of the first algorithm equipped with a theoretical guarantee for finding the best hyperparameter of `p-regularized supervised learning problems. Specifically, we propose a smoothing-type algorithm for the above mentioned bilevel optimization problems and provide a theoretical convergence guarantee for the algorithm. Indeed, since optimality conditions are not known for such bilevel optimization problems so far, new necessary optimality conditions, which are called the SB-KKT conditions, are derived and it is shown that a sequence generated by the proposed algorithm actually accumulates at a point satisfying the SB-KKT conditions under some mild assumptions. The proposed algorithm is simple and scalable as our numerical comparison to Bayesian optimization and grid search indicates. ©2021 Takayuki Okuno, Akiko Takeda, Akihiro Kawana, and Motokazu Watanabe.
引用
收藏
页码:1 / 47
相关论文
共 50 条
  • [1] On lp-hyperparameter Learning via Bilevel Nonsmooth Optimization
    Okuno, Takayuki
    Takeda, Akiko
    Kawana, Akihiro
    Watanabe, Motokazu
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [2] Nonsmooth Bilevel Programming for Hyperparameter Selection
    Moore, Gregory M.
    Bergeron, Charles
    Bennett, Kristin P.
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 374 - 381
  • [3] Bilevel Programming for Hyperparameter Optimization and Meta-Learning
    Franceschi, Luca
    Frasconi, Paolo
    Salzo, Saverio
    Grazzi, Riccardo
    Pontil, Massimilano
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] Hyperparameter Learning Under Data Poisoning: Analysis of the Influence of Regularization via Multiobjective Bilevel Optimization
    Carnerero-Cano, Javier
    Munoz-Gonzalez, Luis
    Spencer, Phillippa
    Lupu, Emil C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16008 - 16022
  • [5] Hyperparameter Learning Under Data Poisoning: Analysis of the Influence of Regularization via Multiobjective Bilevel Optimization
    Carnerero-Cano, Javier
    Munoz-Gonzalez, Luis
    Spencer, Phillippa
    Lupu, Emil C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16008 - 16022
  • [6] Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization
    Shi, Wanli
    Gu, Bin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9621 - 9629
  • [7] Stability and Generalization of Bilevel Programming in Hyperparameter Optimization
    Bao, Fan
    Wu, Guoqiang
    Li, Chongxuan
    Zhu, Jun
    Zhang, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [8] A primal nonsmooth reformulation for bilevel optimization problems
    Helou, Elias S.
    Santos, Sandra A.
    Simoes, Lucas E. A.
    MATHEMATICAL PROGRAMMING, 2023, 198 (02) : 1381 - 1409
  • [9] A primal nonsmooth reformulation for bilevel optimization problems
    Elias S. Helou
    Sandra A. Santos
    Lucas E. A. Simões
    Mathematical Programming, 2023, 198 : 1381 - 1409
  • [10] Optimality conditions for nonsmooth multiobjective bilevel optimization problems
    Chuong, Thai Doan
    ANNALS OF OPERATIONS RESEARCH, 2020, 287 (02) : 617 - 642