Provably Faster Algorithms for Bilevel Optimization

被引:0
|
作者
Yang, Junjie [1 ]
Ji, Kaiyi [2 ]
Liang, Yingbin [1 ]
机构
[1] Ohio State Univ, Dept ECE, Columbus, OH 43210 USA
[2] Univ Michigan, Dept EECS, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilevel optimization has been widely applied in many important machine learning applications such as hyperparameter optimization and meta-learning. Recently, several momentum-based algorithms have been proposed to solve bilevel optimization problems faster. However, those momentum-based algorithms do not achieve provably better computational complexity than e (O) over tilde(epsilon(-2)) of the SGD-based algorithm. In this paper, we propose two new algorithms for bilevel optimization, where the first algorithm adopts momentum-based recursive iterations, and the second algorithm adopts recursive gradient estimations in nested loops to decrease the variance. We show that both algorithms achieve the complexity of (O) over tilde(epsilon(-1.5)), which outperforms all existing algorithms by the order of magnitude. Our experiments validate our theoretical results and demonstrate the superior empirical performance of our algorithms in hyperparameter applications.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Multiobjective bilevel optimization
    Gabriele Eichfelder
    Mathematical Programming, 2010, 123 : 419 - 449
  • [32] Riemannian Bilevel Optimization
    Li, Jiaxiang
    Ma, Shiqian
    JOURNAL OF MACHINE LEARNING RESEARCH, 2025, 26 : 1 - 44
  • [33] Decentralized bilevel optimization
    Chen, Xuxing
    Huang, Minhui
    Ma, Shiqian
    OPTIMIZATION LETTERS, 2024,
  • [34] Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization
    Chen, Lesi
    Xu, Jing
    Luo, Luo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [35] Riemannian Bilevel Optimization
    Li, Jiaxiang
    Ma, Shiqian
    Journal of Machine Learning Research, 2025, 26
  • [36] Nested evolutionary algorithms for computationally expensive bilevel optimization problems: Variants and their systematic analysis
    Singh, Hemant Kumar
    Islam, Md Monjurul
    Ray, Tapabrata
    Ryan, Michael
    SWARM AND EVOLUTIONARY COMPUTATION, 2019, 48 : 329 - 344
  • [37] Trust-region algorithms for derivative-free optimization and nonlinear bilevel programming
    Colson B.
    Quarterly Journal of the Belgian, French and Italian Operations Research Societies, 2004, 2 (1) : 85 - 88
  • [38] Bilevel Optimization by Conditional Bayesian Optimization
    Dogan, Vedat
    Prestwich, Steven
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 243 - 258
  • [39] ALGORITHMS FOR NONLINEAR BILEVEL MATHEMATICAL PROGRAMS
    EDMUNDS, TA
    BARD, JF
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1991, 21 (01): : 83 - 89
  • [40] Algorithms and applications for a class of bilevel MILPs
    Poirion, Pierre-Louis
    Toubaline, Sonia
    D'Ambrosio, Claudia
    Liberti, Leo
    DISCRETE APPLIED MATHEMATICS, 2020, 272 (272) : 75 - 89