Provably Faster Algorithms for Bilevel Optimization

被引:0
|
作者
Yang, Junjie [1 ]
Ji, Kaiyi [2 ]
Liang, Yingbin [1 ]
机构
[1] Ohio State Univ, Dept ECE, Columbus, OH 43210 USA
[2] Univ Michigan, Dept EECS, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilevel optimization has been widely applied in many important machine learning applications such as hyperparameter optimization and meta-learning. Recently, several momentum-based algorithms have been proposed to solve bilevel optimization problems faster. However, those momentum-based algorithms do not achieve provably better computational complexity than e (O) over tilde(epsilon(-2)) of the SGD-based algorithm. In this paper, we propose two new algorithms for bilevel optimization, where the first algorithm adopts momentum-based recursive iterations, and the second algorithm adopts recursive gradient estimations in nested loops to decrease the variance. We show that both algorithms achieve the complexity of (O) over tilde(epsilon(-1.5)), which outperforms all existing algorithms by the order of magnitude. Our experiments validate our theoretical results and demonstrate the superior empirical performance of our algorithms in hyperparameter applications.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Parallel Differential Evolution Algorithms for Stackelberg-Nash Bilevel Optimization Problems
    Magalhaes, Thiago Tavares
    Helio, Barbosa J. C.
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [22] Lossless Compression of Bilevel ROI Maps of Hyperspectral Images Using Optimization Algorithms
    Hooda, Reetu
    Pan, W. David
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [23] FASTER ALGORITHMS
    BEEKER, ER
    BYTE, 1982, 7 (02): : 20 - &
  • [24] PROVABLY FASTER GRADIENT DESCENT VIA LONG STEPS
    Grimmer, Benjamin
    SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (03) : 2588 - 2608
  • [25] Multiobjective bilevel optimization
    Eichfelder, Gabriele
    MATHEMATICAL PROGRAMMING, 2010, 123 (02) : 419 - 449
  • [26] PESSIMISTIC BILEVEL OPTIMIZATION
    Wiesemann, Wolfram
    Tsoukalas, Angelos
    Kleniati, Polyxeni-Margarita
    Rustem, Berc
    SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (01) : 353 - 380
  • [27] An overview of bilevel optimization
    Colson, Benoit
    Marcotte, Patrice
    Savard, Gilles
    ANNALS OF OPERATIONS RESEARCH, 2007, 153 (01) : 235 - 256
  • [28] PROVABLY ACCELERATED RANDOMIZED GOSSIP ALGORITHMS
    Loizou, Nicolas
    Rabbat, Michael
    Richtarik, Peter
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7505 - 7509
  • [29] Provably fast algorithms for contour tracking
    Freedman, D
    Brandstein, MS
    IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, VOL I, 2000, : 139 - 144
  • [30] An overview of bilevel optimization
    Benoît Colson
    Patrice Marcotte
    Gilles Savard
    Annals of Operations Research, 2007, 153 : 235 - 256