Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization

被引:0
|
作者
Chen Z. [1 ]
Mou S. [1 ]
Maguluri S.T. [1 ]
机构
[1] Georgia Institute of Technology, Atlanta, GA
来源
Performance Evaluation Review | 2022年 / 50卷 / 01期
基金
美国国家科学基金会;
关键词
asymptotic analysis; stationary distribution; stochastic approximation; stochastic gradient descent;
D O I
10.1145/3547353.3522659
中图分类号
学科分类号
摘要
Stochastic approximation (SA) and stochastic gradient descent (SGD) algorithms are work-horses for modern machine learning algorithms. Their constant stepsize variants are preferred in practice due to fast convergence behavior. However, constant stepsize SA algorithms do not converge to the optimal solution, but instead have a stationary distribution, which in general cannot be analytically characterized. In this work, we study the asymptotic behavior of the appropriately scaled stationary distribution, in the limit when the constant stepsize goes to zero. Specifically, we consider the following three settings: (1) SGD algorithm with a smooth and strongly convex objective, (2) linear SA algorithm involving a Hurwitz matrix, and (3) nonlinear SA algorithm involving a contractive operator. When the iterate is scaled by 1/√α, where α is the constant stepsize, we show that the limiting scaled stationary distribution is a solution of an implicit equation. Under a uniqueness assumption (which can be removed in certain settings) on this equation, we further characterize the limiting distribution as a Gaussian distribution whose covariance matrix is the unique solution of an appropriate Lyapunov equation. For SA algorithms beyond these cases, our numerical experiments suggest that unlike central limit theorem type results: (1) the scaling factor need not be 1/√α, and (2) the limiting distribution need not be Gaussian. Based on the numerical study, we come up with a heuristic formula to determine the right scaling factor, and make a connection to the Euler-Maruyama discretization scheme for approximating stochastic differential equations. © 2022 Copyright held by the owner/author(s).
引用
收藏
页码:109 / 110
页数:1
相关论文
共 50 条
  • [11] Asymptotic behavior of a stationary silo with absorbing walls
    Barros, SRM
    Ferrari, PA
    Garcia, NL
    Martínez, S
    JOURNAL OF STATISTICAL PHYSICS, 2002, 106 (3-4) : 521 - 546
  • [12] Asymptotic behavior of the prediction error for stationary sequences
    Babayan, Nikolay M.
    Ginovyan, Mamikon S.
    PROBABILITY SURVEYS, 2023, 20 : 664 - 721
  • [13] Asymptotic behavior of stationary nonadiabatic combustion wave
    Khudyaev, S.I.
    Soviet journal of chemical physics, 1992, 10 (06): : 1297 - 1312
  • [14] Asymptotic Behavior of a Stationary Silo with Absorbing Walls
    Saulo R. M. Barros
    Pablo A. Ferrari
    Nancy L. Garcia
    Servet Martínez
    Journal of Statistical Physics, 2002, 106 : 521 - 546
  • [15] Asymptotic Behavior of a Bresse System Due to Thermoelasticity of Type III with Constant Delay Feedback
    Mpungu, Kassimu
    Apalara, Tijani A.
    Nass, Aminu M.
    DIFFERENTIAL EQUATIONS AND DYNAMICAL SYSTEMS, 2025,
  • [16] Weak stationary solution of a G/G/1/∞ queue controlled by IPA-based SA with constant stepsize
    Miyoshi, N
    PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 1716 - 1721
  • [17] Asymptotic behavior of the stationary magnetohydrodynamic equations in an exterior domain
    Fan, Huiying
    Wang, Meng
    JOURNAL OF MATHEMATICAL PHYSICS, 2021, 62 (11)
  • [18] Asymptotic behavior of a stationary combustion wave in a gas mixture
    Kholopov, VM
    Khudyaev, SI
    CHEMICAL PHYSICS REPORTS, 1997, 16 (09): : 1539 - 1549
  • [19] Asymptotic behavior of the convex hull of a stationary Gaussian process
    Davydov, Youri
    Dombry, Clement
    LITHUANIAN MATHEMATICAL JOURNAL, 2012, 52 (04) : 363 - 368
  • [20] ON THE ASYMPTOTIC-BEHAVIOR OF GAUSSIAN SEQUENCES WITH STATIONARY INCREMENTS
    CHOI, YK
    KONO, N
    JOURNAL OF MATHEMATICS OF KYOTO UNIVERSITY, 1991, 31 (03): : 643 - 678