Variance-Reduced Accelerated First-Order Methods: Central Limit Theorems and Confidence Statements

被引：0

作者：

Lei, Jinlong ^{[1
,2
]}

Shanbhag, Uday V. ^{[3
]}

机构：

[1] Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China

[2] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201804, Peoples R China

[3] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2024年

基金：

中国国家自然科学基金;

关键词：

stochastic optimization; variance-reduced schemes; central limit theorems; confidence intervals; STOCHASTIC-APPROXIMATION; ASYMPTOTIC NORMALITY; GRADIENT; OPTIMIZATION; PARAMETERS;

D O I：

10.1287/moor.2021.0068

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we consider a strongly convex stochastic optimization problem and propose three classes of variable sample -size stochastic first -order methods: (i) the standard stochastic gradient descent method, (ii) its accelerated variant, and (iii) the stochastic heavy -ball method. In each scheme, the exact gradients are approximated by averaging across an increasing batch size of sampled gradients. We prove that when the sample size increases at a geometric rate, the generated estimates converge in mean to the optimal solution at an analogous geometric rate for schemes (i)-(iii). Based on this result, we provide central limit statements, whereby it is shown that the rescaled estimation errors converge in distribution to a normal distribution with the associated covariance matrix dependent on the Hessian matrix, the covariance of the gradient noise, and the step length. If the sample size increases at a polynomial rate, we show that the estimation errors decay at a corresponding polynomial rate and establish the associated central limit theorems (CLTs). Under certain conditions, we discuss how both the algorithms and the associated limit theorems may be extended to constrained and nonsmooth regimes. Finally, we provide an avenue to construct confidence regions for the optimal solution based on the established CLTs and test the theoretical findings on a stochastic parameter estimation problem.

引用

页数：35

共 27 条

[21] ACCELERATED FIRST-ORDER PRIMAL-DUAL PROXIMAL METHODS FOR LINEARLY CONSTRAINED COMPOSITE CONVEX PROGRAMMING
Xu, Yangyang
SIAM JOURNAL ON OPTIMIZATION, 2017, 27 (03) : 1459 - 1484
[22] A Fourier-based compressed sensing technique for accelerated CT image reconstruction using first-order methods
Choi, Kihwan
Li, Ruijiang
Nam, Haewon
Xing, Lei
PHYSICS IN MEDICINE AND BIOLOGY, 2014, 59 (12): : 3097 - 3119
[23] The variance of DTI-derived parameters via first-order perturbation methods (vol 57, pg 141, 2007)
Chang, Lin-Ching
MAGNETIC RESONANCE IN MEDICINE, 2008, 59 (04) : 946 - 946
[24] Accelerated first-order methods for large-scale convex optimization: nearly optimal complexity under strong convexity
Masoud Ahookhosh
Mathematical Methods of Operations Research, 2019, 89 : 319 - 353
[25] Accelerated first-order methods for large-scale convex optimization: nearly optimal complexity under strong convexity
Ahookhosh, Masoud
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2019, 89 (03) : 319 - 353
[26] A study on equivalence of nonlinear energy dissipation between first-order computational homogenization (FOCH) and reduced-order homogenization (ROH) methods
Yue, Jiajia
Yuan, Zifeng
THEORETICAL AND APPLIED MECHANICS LETTERS, 2021, 11 (01)
[27] Comparative study of deterministic and probabilistic critical slip surfaces applied to slope stability using limit equilibrium methods and the First-Order Reliability Method
de Assis, Higor Biondo
Nogueira, Caio Gorla
SOILS AND ROCKS, 2023, 46 (02):

← 1 2 3 →