Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems

被引：0

作者：

Luo, Luo ^{[1
]}

Ye, Haishan ^{[2
]}

Huang, Zhichao ^{[1
]}

Zhang, Tong ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Math, Hong Kong, Peoples R China

[2] Chinese Univ Hong Kong, Shenzhen Res Inst Big Data, Shenzhen, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider nonconvex-concave minimax optimization problems of the form min(x) max(y is an element of Y) f (x; y), where f is strongly-concave in y but possibly nonconvex in x and Y is a convex and compact set. We focus on the stochastic setting, where we can only access an unbiased stochastic gradient estimate of f at each iteration. This formulation includes many machine learning applications as special cases such as robust optimization and adversary training. We are interested in finding an O(epsilon)-stationary point of the function Phi(center dot) = max(y is an element of Y) f (center dot, y). The most popular algorithm to solve this problem is stochastic gradient decent ascent, which requires O(kappa 3 epsilon(-4)) stochastic gradient evaluations, where kappa is the condition number. In this paper, we propose a novel method called Stochastic Recursive gradiEnt Descent Ascent (SREDA), which estimates gradients more efficiently using variance reduction. This method achieves the best known stochastic gradient complexity of O(kappa 3 epsilon(-4)), and its dependency on epsilon is optimal for this problem.

引用

页数：12

共 50 条

[21] Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives
Lei, Yunwen
Tang, Ke
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4505 - 4511
[22] Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning
Chen, Ziyi
Ma, Shaocong
Zhou, Yi
2022 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, ISIT, 2022, : 672 - 677
[23] Stochastic Gradient Descent for Nonconvex Learning Without Bounded Gradient Assumptions
Lei, Yunwen
Hu, Ting
Li, Guiying
Tang, Ke
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4394 - 4400
[24] Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm
Zhu, Miaoxi
Shen, Li
Du, Bo
Tao, Dacheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[25] ON DISTRIBUTED STOCHASTIC GRADIENT DESCENT FOR NONCONVEX FUNCTIONS IN THE PRESENCE OF BYZANTINES
Bulusu, Saikiran
Khanduri, Prashant
Sharma, Pranay
Varshney, Pramod K.
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3137 - 3141
[26] An Alternating Gradient Projection Algorithm with Momentum for Nonconvex-Concave Minimax Problems
Li, Jue-You
Xie, Tao
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024,
[27] Efficient Decentralized Stochastic Gradient Descent Method for Nonconvex Finite-Sum Optimization Problems
Zhan, Wenkang
Wu, Gang
Gao, Hongchang
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9006 - 9013
[28] GRAND: A Gradient-Related Ascent and Descent Algorithmic Framework for Minimax Problems
Niu, Xiaochun
Wei, Ermin
2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
[29] Communication-Efficient Stochastic Gradient Descent Ascent with Momentum Algorithms
Zhang, Yihan
Qiu, Meikang
Gao, Hongchang
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4602 - 4610
[30] Local Stochastic Gradient Descent Ascent: Convergence Analysis and Communication Efficiency
Deng, Yuyang
Mandavi, Mehrdad
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130

← 1 2 3 4 5 →