Faster Stochastic Algorithms for Minimax Optimization under Polyak-Lojasiewicz Conditions

被引:0
|
作者
Chen, Lesi [1 ]
Yao, Boyuan [1 ]
Luo, Luo [1 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers stochastic first-order algorithms for minimax optimization under Polyak-Lojasiewicz (PL) conditions. We propose SPIDER-GDA for solving the finite-sum problem of the form min(x) max(y) f(x, y) (sic) 1/n Sigma(n)(i=1) f(i)(x, y), where the objective function f(x, y) is mu(x)-PL in x and mu(y)-PL in y; and each f(i)(x, y) is L-smooth. We prove SPIDER-GDA could find an.-approximate solution within O((n + root n kappa(x)kappa(2)(y)) log(1/epsilon)) stochastic first-order oracle (SFO) complexity, which is better than the state-of-the-art method whose SFO upper bound is O((n + n(2/3)kappa(x)kappa(2)(y)) log(1/epsilon)), where kappa(x) (sic) L/mu(x) and kappa(y) (sic) L/mu(y). For the ill-conditioned case, we provide an accelerated algorithm to reduce the computational cost further. It achieves (O) over tilde (n + root n kappa(x)kappa(y)) log(2) (1/epsilon)) SFO upper bound when kappa(y) greater than or similar to root n. Our ideas also can be applied to the more general setting that the objective function only satisfies PL condition for one variable. Numerical experiments validate the superiority of proposed methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Faster Stochastic Algorithms for Minimax Optimization under Polyak-Lojasiewicz Conditions
    Chen, Lesi
    Yao, Boyuan
    Luo, Luo
    Advances in Neural Information Processing Systems, 2022, 35
  • [2] Asynchronous Parallel Nonconvex Optimization Under the Polyak-Lojasiewicz Condition
    Yazdani, Kasra
    Hale, Matthew
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 524 - 529
  • [3] Gradient-Free Algorithms for Solving Stochastic Saddle Optimization Problems with the Polyak-Lojasiewicz Condition
    Sadykov, S. I.
    Lobanov, A. V.
    Raigorodskii, A. M.
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (06) : 535 - 547
  • [4] A Generalized Alternating Method for Bilevel Optimization under the Polyak-Lojasiewicz Condition
    Rensselaer Polytechnic Institute, Troy
    NY, United States
    不详
    NY, United States
    arXiv, 1600,
  • [5] A Generalized Alternating Method for Bilevel Optimization under the Polyak-Lojasiewicz Condition
    Xiao, Quan
    Lu, Songtao
    Chen, Tianyi
    Advances in Neural Information Processing Systems, 2023, 36
  • [6] Distributed Event-Triggered Nonconvex Optimization under Polyak-Lojasiewicz Condition
    Gao, Chao
    Xu, Lei
    Zhang, Kunpeng
    Li, Yuzhe
    Liu, Zhiwei
    Yang, Tao
    2024 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, ICARCV, 2024, : 930 - 935
  • [7] OVER-PARAMETERIZED MODEL OPTIMIZATION WITH POLYAK-LOJASIEWICZ CONDITION
    Chen, Yixuan
    Shi, Yubin
    Dong, Mingzhi
    Yang, Xiaochen
    Li, Dongsheng
    Wang, Yujiang
    Dick, Robert P.
    Lv, Qin
    Zhao, Yingying
    Yang, Fan
    Gu, Ning
    Shang, Li
    11th International Conference on Learning Representations, ICLR 2023, 2023,
  • [8] Online Stochastic Gradient Methods Under Sub-Weibull Noise and the Polyak-Lojasiewicz Condition
    Kim, Seunghyun
    Madden, Liam
    Dall'Anese, Emiliano
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3499 - 3506
  • [9] Online Stochastic Gradient Methods Under Sub-Weibull Noise and the Polyak-Lojasiewicz Condition
    University of Colorado, Department of Applied Mathematics, Boulder, United States
    不详
    Proc IEEE Conf Decis Control, 2022, (3499-3506):
  • [10] Dynamic Regret Bounds for Constrained Online Nonconvex Optimization Based on Polyak-Lojasiewicz Regions
    Mulvaney-Kemp, Julie
    Park, SangWoo
    Jin, Ming
    Lavaei, Javad
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (02): : 599 - 611