ROBUST ACCELERATED GRADIENT METHODS FOR SMOOTH STRONGLY CONVEX FUNCTIONS

被引：29

作者：

Aybat, Necdet Serhat ^{[1
]}

Fallah, Alireza ^{[2
]}

Gurbuzbalaban, Mert ^{[3
]}

Ozdaglar, Asuman ^{[2
]}

机构：

[1] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA

[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA

[3] Rutgers State Univ, Dept Management Sci & Informat Syst, Piscataway, NJ 08854 USA

来源：

SIAM JOURNAL ON OPTIMIZATION | 2020年 / 30卷 / 01期

关键词：

convex optimization; stochastic approximation; robust control theory; accelerated methods; Nesterov's method; matrix inequalities; STOCHASTIC-APPROXIMATION ALGORITHMS; OPTIMIZATION ALGORITHMS; COMPOSITE OPTIMIZATION; H-2;

D O I：

10.1137/19M1244925

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We study the trade-offs between convergence rate and robustness to gradient errors in designing a first-order algorithm. We focus on gradient descent and accelerated gradient (AG) methods for minimizing strongly convex functions when the gradient has random errors in the form of additive white noise. With gradient errors, the function values of the iterates need not converge to the optimal value; hence, we define the robustness of an algorithm to noise as the asymptotic expected suboptimality of the iterate sequence to input noise power. For this robustness measure, we provide exact expressions for the quadratic case using tools from robust control theory and tight upper bounds for the smooth strongly convex case using Lyapunov functions certified through matrix inequalities. We use these characterizations within an optimization problem which selects parameters of each algorithm to achieve a particular trade-off between rate and robustness. Our results show that AG can achieve acceleration while being more robust to random gradient errors. This behavior is quite different than previously reported in the deterministic gradient noise setting. We also establish some connections between the robustness of an algorithm and how quickly it can converge back to the optimal solution if it is perturbed from the optimal point with deterministic noise. Our framework also leads to practical algorithms that can perform better than other state-of-the-art methods in the presence of random gradient noise.

引用

页码：717 / 751

页数：35

共 50 条

[31] Improved Accelerated Gradient Algorithms with Line Search for Smooth Convex Optimization Problems
Li, Ting
Song, Yongzhong
Cai, Xingju
ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2024, 41 (03)
[32] Near-optimal tensor methods for minimizing the gradient norm of convex functions and accelerated primal-dual tensor methods
Dvurechensky, Pavel
Ostroukhov, Petr
Gasnikov, Alexander
Uribe, Cesar A.
Ivanova, Anastasiya
OPTIMIZATION METHODS & SOFTWARE, 2024, 39 (05): : 1068 - 1103
[33] ACCELERATED REGULARIZED NEWTON METHODS FOR MINIMIZING COMPOSITE CONVEX FUNCTIONS
Grapiglia, Geovani N.
Nesterov, Yurii
SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (01) : 77 - 99
[34] On the Gradient Projection Method for Weakly Convex Functions on a Proximally Smooth Set
Balashov, M., V
MATHEMATICAL NOTES, 2020, 108 (5-6) : 643 - 651
[35] On the Gradient Projection Method for Weakly Convex Functions on a Proximally Smooth Set
M. V. Balashov
Mathematical Notes, 2020, 108 : 643 - 651
[36] Smooth convex extensions of convex functions
Azagra, Daniel
Mudarra, Carlos
CALCULUS OF VARIATIONS AND PARTIAL DIFFERENTIAL EQUATIONS, 2019, 58 (03)
[37] Smooth convex extensions of convex functions
Daniel Azagra
Carlos Mudarra
Calculus of Variations and Partial Differential Equations, 2019, 58
[38] Strongly hyperbolically convex functions
Cruz, Lorena
Mejia, Diego
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2007, 335 (02) : 1403 - 1415
[39] Remarks on strongly convex functions
Merentes, Nelson
Nikodem, Kazimierz
AEQUATIONES MATHEMATICAE, 2010, 80 (1-2) : 193 - 199
[40] Remarks on strongly convex functions
Nelson Merentes
Kazimierz Nikodem
Aequationes mathematicae, 2010, 80 : 193 - 199

← 1 2 3 4 5 →