On `p-hyperparameter Learning via Bilevel Nonsmooth Optimization

被引:0
|
作者
T., Okuno
A., Takeda
A., Kawana
M., Watanabe
机构
[1] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[2] Graduate School of Information Science and Technology, The University of Tokyo, Tokyo,113-8656, Japan
[3] Center for Advanced Intelligence Project, RIKEN, Tokyo,103-0027, Japan
[4] Department of Industrial Engineering and Economics, Tokyo Institute of Technology, Tokyo,152-8550, Japan
[5] Department of Mathematical Informatics, The University of Tokyo, Tokyo,113-8656, Japan
基金
日本学术振兴会;
关键词
Computational methods;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose a bilevel optimization strategy for selecting the best hyperparameter value for the nonsmooth `p regularizer with 0 p-regularized problem as the lower-level problem. Despite the recent popularity of nonconvex `p-regularizer and the usefulness of bilevel optimization for selecting hyperparameters, algorithms for such bilevel problems have not been studied because of the difficulty of `p-regularizer. Our contribution is the proposal of the first algorithm equipped with a theoretical guarantee for finding the best hyperparameter of `p-regularized supervised learning problems. Specifically, we propose a smoothing-type algorithm for the above mentioned bilevel optimization problems and provide a theoretical convergence guarantee for the algorithm. Indeed, since optimality conditions are not known for such bilevel optimization problems so far, new necessary optimality conditions, which are called the SB-KKT conditions, are derived and it is shown that a sequence generated by the proposed algorithm actually accumulates at a point satisfying the SB-KKT conditions under some mild assumptions. The proposed algorithm is simple and scalable as our numerical comparison to Bayesian optimization and grid search indicates. ©2021 Takayuki Okuno, Akiko Takeda, Akihiro Kawana, and Motokazu Watanabe.
引用
收藏
页码:1 / 47
相关论文
共 50 条
  • [31] Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning
    Giovanelli, Joseph
    Tornede, Alexander
    Tornede, Tanja
    Lindauer, Marius
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12172 - 12180
  • [32] Enhanced Bilevel Optimization via Bregman
    Huang, Feihu
    Li, Junyi
    Gao, Shangqian
    Huang, Heng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [33] Hyperparameter Learning via Distributional Transfer
    Law, Ho Chung Leon
    Zhao, Peilin
    Chan, Lucian
    Huang, Junzhou
    Sejdinovic, Dino
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Data Summarization via Bilevel Optimization
    Borsos, Zalan
    Mutny, Mojmir
    Tagliasacchi, Marco
    Krause, Andreas
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 53
  • [35] Model selection via bilevel optimization
    Bennett, Kristin P.
    Hu, Jing
    Ji, Xiaoyun
    Kunapuli, Gautam
    Pang, Jong-Shi
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 1922 - +
  • [36] A fresh look at nonsmooth Levenberg-Marquardt methods with applications to bilevel optimization
    Jolaoso, Lateef O.
    Mehlitz, Patrick
    Zemkoho, Alain B.
    OPTIMIZATION, 2024,
  • [37] Unified smoothing approach for best hyperparameter selection problem using a bilevel optimization strategy
    Alcantara, Jan Harold
    Nguyen, Chieu Thanh
    Okuno, Takayuki
    Takeda, Akiko
    Chen, Jein-Shan
    MATHEMATICAL PROGRAMMING, 2024,
  • [38] APPROXIMATE KARUSH-KUHN-TUCKER CONDITIONS FOR NONSMOOTH BILEVEL OPTIMIZATION PROBLEMS
    El Idrissi, Rachid
    Kalmoun, El Mostafa
    Lafhim, Lahoussine
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2025, 21 (02) : 988 - 1006
  • [39] APPROXIMATE KARUSH-KUHN-TUCKER CONDITIONS FOR NONSMOOTH BILEVEL OPTIMIZATION PROBLEMS
    El Idrissi, Rachid
    Kalmoun, El Mostafa
    Lafhim, Lahoussine
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2025, 21 (02) : 988 - 1006
  • [40] Hyperparameter Optimization via Sequential Uniform Designs
    Yang, Zebin
    Zhang, Aijun
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22