ON UNIFORM-IN-TIME DIFFUSION APPROXIMATION FOR STOCHASTIC GRADIENT DESCENT

被引:0
|
作者
Li, Lei [1 ,2 ,3 ]
Wang, Yuliang [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Nat Sci, MOE LSC, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai 200240, Peoples R China
基金
国家重点研发计划;
关键词
stochastic differential equation; backward Kolmogorov equation; Stroock-Varadhan support theorem; semigroup expansion; SYSTEMS;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The diffusion approximation of stochastic gradient descent (SGD) in current literature is only valid on a finite time interval. In this paper, we establish the uniform-in-time diffusion approximation of SGD, by only assuming that the expected loss is strongly convex and some other mild conditions, without assuming the convexity of each random loss function. The main technique is to establish the exponential decay rates of the derivatives of the solution to the backward Kolmogorov equation. The uniform-in-time approximation allows us to study asymptotic behaviors of SGD via the continuous stochastic differential equation (SDE) even when the random objective function f(<middle dot>; xi ) is not strongly convex.
引用
收藏
页码:95 / 112
页数:18
相关论文
共 50 条
  • [1] UNIFORM-IN-TIME WEAK ERROR ANALYSIS FOR STOCHASTIC GRADIENT DESCENT ALGORITHMS VIA DIFFUSION APPROXIMATION
    Feng, Yuanyuan
    Gao, Tingran
    Li, Lei
    Liu, Jian-Guo
    Lu, Yulong
    [J]. COMMUNICATIONS IN MATHEMATICAL SCIENCES, 2020, 18 (01) : 163 - 188
  • [2] Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient Descent
    Zhu, Lingjiong
    Gurbuzbalaban, Mert
    Raj, Anant
    Simsekli, Umut
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] On the diffusion approximation of nonconvex stochastic gradient descent
    Hu, Wenqing
    Li, Chris Junchi
    Li, Lei
    Liu, Jian-Guo
    [J]. ANNALS OF MATHEMATICAL SCIENCES AND APPLICATIONS, 2019, 4 (01) : 3 - 32
  • [4] Uniform-in-time bounds for approximate solutions of the drift–diffusion system
    M. Bessemoulin-Chatard
    C. Chainais-Hillairet
    [J]. Numerische Mathematik, 2019, 141 : 881 - 916
  • [5] Stochastic Gradient Descent in Continuous Time
    Sirignano, Justin
    Spiliopoulos, Konstantinos
    [J]. SIAM JOURNAL ON FINANCIAL MATHEMATICS, 2017, 8 (01): : 933 - 961
  • [6] Uniform-in-time bounds for approximate solutions of the drift-diffusion system
    Bessemoulin-Chatard, M.
    Chainais-Hillairet, C.
    [J]. NUMERISCHE MATHEMATIK, 2019, 141 (04) : 881 - 916
  • [7] Convergence behavior of diffusion stochastic gradient descent algorithm
    Barani, Fatemeh
    Savadi, Abdorreza
    Yazdi, Hadi Sadoghi
    [J]. SIGNAL PROCESSING, 2021, 183
  • [8] Analysis of stochastic gradient descent in continuous time
    Jonas Latz
    [J]. Statistics and Computing, 2021, 31
  • [9] Analysis of stochastic gradient descent in continuous time
    Latz, Jonas
    [J]. STATISTICS AND COMPUTING, 2021, 31 (04)
  • [10] CONTROLLING STOCHASTIC GRADIENT DESCENT USING STOCHASTIC APPROXIMATION FOR ROBUST DISTRIBUTED OPTIMIZATION
    Jain, Adit
    Krishnamurthy, Vikram
    [J]. NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2024,