On `p-hyperparameter Learning via Bilevel Nonsmooth Optimization

被引:0
|
作者
T., Okuno
A., Takeda
A., Kawana
M., Watanabe
机构
[1] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[2] Graduate School of Information Science and Technology, The University of Tokyo, Tokyo,113-8656, Japan
[3] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[4] Department of Industrial Engineering and Economics, Tokyo Institute of Technology, Tokyo,152-8550, Japan
[5] Department of Mathematical Informatics, The University of Tokyo, Tokyo,113-8656, Japan
基金
日本学术振兴会;
关键词
Computational methods;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose a bilevel optimization strategy for selecting the best hyperparameter value for the nonsmooth `p regularizer with 0 p-regularized problem as the lower-level problem. Despite the recent popularity of nonconvex `p-regularizer and the usefulness of bilevel optimization for selecting hyperparameters, algorithms for such bilevel problems have not been studied because of the difficulty of `p-regularizer. Our contribution is the proposal of the first algorithm equipped with a theoretical guarantee for finding the best hyperparameter of `p-regularized supervised learning problems. Specifically, we propose a smoothing-type algorithm for the above mentioned bilevel optimization problems and provide a theoretical convergence guarantee for the algorithm. Indeed, since optimality conditions are not known for such bilevel optimization problems so far, new necessary optimality conditions, which are called the SB-KKT conditions, are derived and it is shown that a sequence generated by the proposed algorithm actually accumulates at a point satisfying the SB-KKT conditions under some mild assumptions. The proposed algorithm is simple and scalable as our numerical comparison to Bayesian optimization and grid search indicates. ©2021 Takayuki Okuno, Akiko Takeda, Akihiro Kawana, and Motokazu Watanabe.
引用
收藏
页码:1 / 47
相关论文
共 50 条
  • [21] TransBO: Hyperparameter Optimization via Two-Phase Transfer Learning
    Li, Yang
    Shen, Yu
    Jiang, Huaijun
    Zhang, Wentao
    Yang, Zhi
    Zhang, Ce
    Cui, Bin
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 956 - 966
  • [22] LEARNING SMOOTH MODELS OF NONSMOOTH FUNCTIONS VIA CONVEX OPTIMIZATION
    Lauer, F.
    Le, V. L.
    Bloch, G.
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
  • [23] Bilevel optimization and machine learning
    Bennett, Kristin P.
    Kunapuli, Gautam
    Hu, Jing
    Pang, Jong-Shi
    COMPUTATIONAL INTELLIGENCE: RESEARCH FRONTIERS, 2008, 5050 : 25 - +
  • [24] Bilevel hyperparameter optimization for support vector classification: theoretical analysis and a solution method
    Qingna Li
    Zhen Li
    Alain Zemkoho
    Mathematical Methods of Operations Research, 2022, 96 : 315 - 350
  • [25] Fast-Convergence Federated Edge Learning via Bilevel Optimization
    Wang, Sai
    Gong, Yi
    2023 28TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS, APCC 2023, 2023, : 87 - 92
  • [26] SEMI-SUPERVISED BATCH ACTIVE LEARNING VIA BILEVEL OPTIMIZATION
    Borsos, Zalan
    Tagliasacchi, Marco
    Krause, Andreas
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3495 - 3499
  • [27] A fast smoothing newton method for bilevel hyperparameter optimization for SVC with logistic loss
    Wang, Yixin
    Li, Qingna
    OPTIMIZATION, 2024,
  • [28] AN ALTERNATING LINEARIZATION METHOD WITH INEXACT DATA FOR BILEVEL NONSMOOTH CONVEX OPTIMIZATION
    Li, Dan
    Pang, Li-Ping
    Guo, Fang-Fang
    Xia, Zun-Quan
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2014, 10 (03) : 859 - 869
  • [29] Optimality conditions for nonsmooth multiobjective bilevel optimization using tangential subdifferentials
    Jennane, Mohsine
    Kalmoun, El Mostafa
    El Fadil, Lhoussain
    RAIRO-OPERATIONS RESEARCH, 2021, 55 (05) : 3041 - 3048
  • [30] Bilevel hyperparameter optimization for support vector classification: theoretical analysis and a solution method
    Li, Qingna
    Li, Zhen
    Zemkoho, Alain
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2022, 96 (03) : 315 - 350