A momentum accelerated stochastic method and its application on policy search problems

被引:0
|
作者
Boou Jiang [1 ]
Ya-xiang Yuan [2 ]
机构
[1] LSEC,
[2] ICMSEC,undefined
[3] AMSS,undefined
[4] Chniese Academy of Sciences,undefined
[5] University of Chinese Academy of Sciences,undefined
关键词
Stochastic algorithm; Non-convex optimization; Reinforcement learning; 65K05; 90C15; 90C25; 90C30;
D O I
10.1007/s00521-024-10883-y
中图分类号
学科分类号
摘要
With the dramatic increase in model complexity and problem scales in the machine learning area, researches on the first-order stochastic methods and its accelerated variants for non-convex problems have attracted wide research interest. However, most works on convergence analysis of accelerated methods focus on general convex or strongly convex objective functions. In this paper, we consider an accelerated scheme coming from dynamic systems and ordinary differential equations, which has a simpler and more direct form than the traditional scheme. We construct auxiliary sequences of iteration points as analysis tools, which can be interpreted as extension of Nesterov’s estimate sequence in non-convex case. We analyze the convergence property under different cases when momentum parameters are fixed or varying over iterations. For non-smooth and general convex objective functions, we give a relaxed step-size requirement to ensure convergence. For the non-convex policy search problem in classical reinforcement learning, we propose an accelerated stochastic policy gradient method with restart technique and construct numerical experiments to verify its effectiveness.
引用
收藏
页码:5957 / 5973
页数:16
相关论文
共 50 条
  • [21] THE EXPERIENTIAL METHOD: ITS APPLICATION IN ACCELERATED AGING ON SEED GERMINATION
    Lopez, Jose Homero Vargas
    REVISTA UNIVERSIDAD Y SOCIEDAD, 2023, 15 (02): : 326 - 335
  • [22] A Stochastic Approximation Method and Its Application to Confidence Intervals
    Garthwaite, Paul H.
    Jones, M. C.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (01) : 184 - 200
  • [23] An application of Stochastic heuristic search method to edge extraction in noisy images
    Han, JW
    Guo, L
    IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 57 - 62
  • [24] Evolutionary optimality in stochastic search problems
    Preston, Mark D.
    Pitchford, Jonathan W.
    Wood, A. Jamie
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2010, 7 (50) : 1301 - 1310
  • [25] Decomposition method and its application to the extremal problems
    Gorecki, Henryk
    Zaczyk, Mieczyslaw
    ARCHIVES OF CONTROL SCIENCES, 2016, 26 (01): : 49 - 67
  • [26] FORCE METHOD AND ITS APPLICATION IN PLASTICITY PROBLEMS
    BESSELING, JF
    COMPUTERS & STRUCTURES, 1978, 8 (3-4) : 323 - 330
  • [27] Quasilinearization method and its application to physical problems
    Mandelzweig, VB
    FEW BODY PROBLEMS IN PHYSICS '02, 2003, 14 : 185 - 190
  • [28] ON TREFFTZS METHOD AND ITS APPLICATION TO EIGENVALUE PROBLEMS
    GOERISCH, F
    ZIMMERMANN, S
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1986, 66 (05): : T304 - T306
  • [29] An accelerated first-order regularized momentum descent ascent algorithm for stochastic nonconvex-concave minimax problems
    Zhang, Huiling
    Xu, Zi
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2025, 90 (02) : 557 - 582
  • [30] An Accelerated Stochastic Mirror Descent Method
    Jiang, Bo-Ou
    Yuan, Ya-Xiang
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024, 12 (03) : 549 - 571