Variance-Reduced Accelerated First-Order Methods: Central Limit Theorems and Confidence Statements

被引:0
|
作者
Lei, Jinlong [1 ,2 ]
Shanbhag, Uday V. [3 ]
机构
[1] Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201804, Peoples R China
[3] Penn State Univ, Dept Ind & Mfg Engn, University Pk, PA 16802 USA
基金
中国国家自然科学基金;
关键词
stochastic optimization; variance-reduced schemes; central limit theorems; confidence intervals; STOCHASTIC-APPROXIMATION; ASYMPTOTIC NORMALITY; GRADIENT; OPTIMIZATION; PARAMETERS;
D O I
10.1287/moor.2021.0068
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we consider a strongly convex stochastic optimization problem and propose three classes of variable sample -size stochastic first -order methods: (i) the standard stochastic gradient descent method, (ii) its accelerated variant, and (iii) the stochastic heavy -ball method. In each scheme, the exact gradients are approximated by averaging across an increasing batch size of sampled gradients. We prove that when the sample size increases at a geometric rate, the generated estimates converge in mean to the optimal solution at an analogous geometric rate for schemes (i)-(iii). Based on this result, we provide central limit statements, whereby it is shown that the rescaled estimation errors converge in distribution to a normal distribution with the associated covariance matrix dependent on the Hessian matrix, the covariance of the gradient noise, and the step length. If the sample size increases at a polynomial rate, we show that the estimation errors decay at a corresponding polynomial rate and establish the associated central limit theorems (CLTs). Under certain conditions, we discuss how both the algorithms and the associated limit theorems may be extended to constrained and nonsmooth regimes. Finally, we provide an avenue to construct confidence regions for the optimal solution based on the established CLTs and test the theoretical findings on a stochastic parameter estimation problem.
引用
收藏
页数:35
相关论文
共 27 条