Strong error analysis for stochastic gradient descent optimization algorithms

被引:13
|
作者
Jentzen, Arnulf [1 ]
Kuckuck, Benno [1 ]
Neufeld, Ariel [2 ]
von Wurstemberger, Philippe [3 ]
机构
[1] Univ Munster, Fac Math & Comp Sci, D-48149 Munster, Germany
[2] NTU Singapore, Div Math Sci, Singapore 637371, Singapore
[3] Swiss Fed Inst Technol, Dept Math, CH-8092 Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
Stochastic gradient descent; Stochastic approximation algorithms; Strong error analysis; CONVERGENCE RATE; ROBBINS-MONRO; APPROXIMATION; MOMENTS; RATES;
D O I
10.1093/imanum/drz055
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Stochastic gradient descent (SGD) optimization algorithms are key ingredients in a series of machine learning applications. In this article we perform a rigorous strong error analysis for SGD optimization algorithms. In particular, we prove for every arbitrarily small epsilon is an element of (0,infinity) and every arbitrarily large p epsilon (0,infinity) that the considered SGD optimization algorithm converges in the strong L-p-sense with order 1/2-epsilon to the global minimum of the objective function of the considered stochastic optimization problem under standard convexity-type assumptions on the objective function and relaxed assumptions on the moments of the stochastic errors appearing in the employed SGD optimization algorithm. The key ideas in our convergence proof are, first, to employ techniques from the theory of Lyapunov-type functions for dynamical systems to develop a general convergence machinery for SGD optimization algorithms based on such functions, then, to apply this general machinery to concrete Lyapunov-type functions with polynomial structures and, thereafter, to perform an induction argument along the powers appearing in the Lyapunov-type functions in order to achieve for every arbitrarily large p epsilon (0,infinity) strong L-p-convergence rates.
引用
收藏
页码:455 / 492
页数:38
相关论文
共 50 条
  • [1] Error Analysis of Stochastic Gradient Descent Ranking
    Chen, Hong
    Tang, Yi
    Li, Luoqing
    Yuan, Yuan
    Li, Xuelong
    Tang, Yuanyan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (03) : 898 - 909
  • [2] Perlustration of error surfaces for nonlinear stochastic gradient descent algorithms
    Hanna, AI
    Krcmar, IR
    Mandic, DP
    [J]. 2002 6TH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2002, : 11 - 16
  • [3] Convergence analysis of gradient descent stochastic algorithms
    Shapiro, A
    Wardi, Y
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1996, 91 (02) : 439 - 454
  • [4] BACKWARD ERROR ANALYSIS AND THE QUALITATIVE BEHAVIOUR OF STOCHASTIC OPTIMIZATION ALGORITHMS: APPLICATION TO STOCHASTIC COORDINATE DESCENT
    DI Giovacchino, Stefano
    Higham, Desmond j.
    Zygalakis, Konstantinos c.
    [J]. JOURNAL OF COMPUTATIONAL DYNAMICS, 2024,
  • [5] Stability and optimization error of stochastic gradient descent for pairwise learning
    Shen, Wei
    Yang, Zhenhuan
    Ying, Yiming
    Yuan, Xiaoming
    [J]. ANALYSIS AND APPLICATIONS, 2020, 18 (05) : 887 - 927
  • [6] UNIFORM-IN-TIME WEAK ERROR ANALYSIS FOR STOCHASTIC GRADIENT DESCENT ALGORITHMS VIA DIFFUSION APPROXIMATION
    Feng, Yuanyuan
    Gao, Tingran
    Li, Lei
    Liu, Jian-Guo
    Lu, Yulong
    [J]. COMMUNICATIONS IN MATHEMATICAL SCIENCES, 2020, 18 (01) : 163 - 188
  • [7] Bound Analysis of Natural Gradient Descent in Stochastic Optimization Setting
    Luo, Zhijian
    Liao, Danping
    Qian, Yuntao
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4166 - 4171
  • [8] Stochastic gradient descent for optimization for nuclear systems
    Austin Williams
    Noah Walton
    Austin Maryanski
    Sandra Bogetic
    Wes Hines
    Vladimir Sobes
    [J]. Scientific Reports, 13
  • [9] Ant colony optimization and stochastic gradient descent
    Meuleau, N
    Dorigo, M
    [J]. ARTIFICIAL LIFE, 2002, 8 (02) : 103 - 121
  • [10] Stochastic gradient descent for wind farm optimization
    Quick, Julian
    Rethore, Pierre-Elouan
    Pedersen, Mads Molgaard
    Rodrigues, Rafael Valotta
    Friis-Moller, Mikkel
    [J]. WIND ENERGY SCIENCE, 2023, 8 (08) : 1235 - 1250