Provably Faster Algorithms for Bilevel Optimization

被引:0
|
作者
Yang, Junjie [1 ]
Ji, Kaiyi [2 ]
Liang, Yingbin [1 ]
机构
[1] Ohio State Univ, Dept ECE, Columbus, OH 43210 USA
[2] Univ Michigan, Dept EECS, Ann Arbor, MI 48109 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilevel optimization has been widely applied in many important machine learning applications such as hyperparameter optimization and meta-learning. Recently, several momentum-based algorithms have been proposed to solve bilevel optimization problems faster. However, those momentum-based algorithms do not achieve provably better computational complexity than e (O) over tilde(epsilon(-2)) of the SGD-based algorithm. In this paper, we propose two new algorithms for bilevel optimization, where the first algorithm adopts momentum-based recursive iterations, and the second algorithm adopts recursive gradient estimations in nested loops to decrease the variance. We show that both algorithms achieve the complexity of (O) over tilde(epsilon(-1.5)), which outperforms all existing algorithms by the order of magnitude. Our experiments validate our theoretical results and demonstrate the superior empirical performance of our algorithms in hyperparameter applications.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Practical bilevel programming: Algorithms and applications
    Anandalingam, G
    INTERFACES, 1999, 29 (06) : 141 - 143
  • [42] PARTICLE SWARM OPTIMIZATION ALGORITHMS BASED ON KKTPM TERMINATION CONDITION FOR OPTIMISTIC SEMIVECTOR BILEVEL PROGRAMS
    Zhang, Tao
    Chen, Jiawei
    Wan, Zhongping
    Zheng, Yue
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2020, 21 (02) : 519 - 532
  • [43] PROVABLY OPTIMAL-ALGORITHMS FOR SIGNAL ROUTING
    PAPADOPOULOS, CV
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1994, 9 (04): : 211 - 219
  • [44] Faster Stochastic Algorithms for Minimax Optimization under Polyak-Lojasiewicz Conditions
    Chen, Lesi
    Yao, Boyuan
    Luo, Luo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [45] Faster Stochastic Algorithms for Minimax Optimization under Polyak-Lojasiewicz Conditions
    Chen, Lesi
    Yao, Boyuan
    Luo, Luo
    Advances in Neural Information Processing Systems, 2022, 35
  • [46] Faster Single-loop Algorithms for Minimax Optimization without Strong Concavity
    Yang, Junchi
    Orvieto, Antonio
    Lucchi, Aurelien
    He, Niao
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [47] Provably correct derivation of algorithms using FermaT
    Ward, Martin
    Zedan, Hussein
    FORMAL ASPECTS OF COMPUTING, 2014, 26 (05) : 993 - 1031
  • [48] Special issue on bilevel optimization
    Brotcorne, Luce
    Fortz, Bernard
    Labbe, Martine
    EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION, 2020, 8 (01) : 1 - 2
  • [49] Multicriteria approach to bilevel optimization
    Fliege, J.
    Vicente, L. N.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2006, 131 (02) : 209 - 225
  • [50] Pessimistic Bilevel Optimization: A Survey
    Liu, June
    Fan, Yuxin
    Chen, Zhong
    Zheng, Yue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 725 - 736