A momentum accelerated stochastic method and its application on policy search problems

被引：0

作者：

Boou Jiang ^{[1
]}

Ya-xiang Yuan ^{[2
]}

机构：

[1] LSEC,

[2] ICMSEC,undefined

[3] AMSS,undefined

[4] Chniese Academy of Sciences,undefined

[5] University of Chinese Academy of Sciences,undefined

来源：

Neural Computing and Applications | 2025年 / 37卷 / 8期

关键词：

Stochastic algorithm; Non-convex optimization; Reinforcement learning; 65K05; 90C15; 90C25; 90C30;

D O I：

10.1007/s00521-024-10883-y

中图分类号：

学科分类号：

摘要：

With the dramatic increase in model complexity and problem scales in the machine learning area, researches on the first-order stochastic methods and its accelerated variants for non-convex problems have attracted wide research interest. However, most works on convergence analysis of accelerated methods focus on general convex or strongly convex objective functions. In this paper, we consider an accelerated scheme coming from dynamic systems and ordinary differential equations, which has a simpler and more direct form than the traditional scheme. We construct auxiliary sequences of iteration points as analysis tools, which can be interpreted as extension of Nesterov’s estimate sequence in non-convex case. We analyze the convergence property under different cases when momentum parameters are fixed or varying over iterations. For non-smooth and general convex objective functions, we give a relaxed step-size requirement to ensure convergence. For the non-convex policy search problem in classical reinforcement learning, we propose an accelerated stochastic policy gradient method with restart technique and construct numerical experiments to verify its effectiveness.

引用

页码：5957 / 5973

页数：16

共 50 条

[21] THE EXPERIENTIAL METHOD: ITS APPLICATION IN ACCELERATED AGING ON SEED GERMINATION
Lopez, Jose Homero Vargas
REVISTA UNIVERSIDAD Y SOCIEDAD, 2023, 15 (02): : 326 - 335
[22] A Stochastic Approximation Method and Its Application to Confidence Intervals
Garthwaite, Paul H.
Jones, M. C.
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (01) : 184 - 200
[23] An application of Stochastic heuristic search method to edge extraction in noisy images
Han, JW
Guo, L
IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 57 - 62
[24] Evolutionary optimality in stochastic search problems
Preston, Mark D.
Pitchford, Jonathan W.
Wood, A. Jamie
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2010, 7 (50) : 1301 - 1310
[25] Decomposition method and its application to the extremal problems
Gorecki, Henryk
Zaczyk, Mieczyslaw
ARCHIVES OF CONTROL SCIENCES, 2016, 26 (01): : 49 - 67
[26] FORCE METHOD AND ITS APPLICATION IN PLASTICITY PROBLEMS
BESSELING, JF
COMPUTERS & STRUCTURES, 1978, 8 (3-4) : 323 - 330
[27] Quasilinearization method and its application to physical problems
Mandelzweig, VB
FEW BODY PROBLEMS IN PHYSICS '02, 2003, 14 : 185 - 190
[28] ON TREFFTZS METHOD AND ITS APPLICATION TO EIGENVALUE PROBLEMS
GOERISCH, F
ZIMMERMANN, S
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1986, 66 (05): : T304 - T306
[29] An accelerated first-order regularized momentum descent ascent algorithm for stochastic nonconvex-concave minimax problems
Zhang, Huiling
Xu, Zi
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2025, 90 (02) : 557 - 582
[30] An Accelerated Stochastic Mirror Descent Method
Jiang, Bo-Ou
Yuan, Ya-Xiang
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024, 12 (03) : 549 - 571

← 1 2 3 4 5 →