On Convergence Rate of MRetrace

被引：0

作者：

Chen, Xingguo ^{[1
]}

Qin, Wangrong ^{[1
]}

Gong, Yu ^{[1
]}

Yang, Shangdong ^{[1
]}

Wang, Wenhao ^{[2
,3
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Peoples R China

[2] Natl Univ Def Technol, Coll Elect Engn, Changsha 410073, Peoples R China

[3] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Peoples R China

来源：

MATHEMATICS | 2024年 / 12卷 / 18期

关键词：

finite sample analysis; off-policy learning; minimum eigenvalues; MRetrace;

D O I：

10.3390/math12182930

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Off-policy is a key setting for reinforcement learning algorithms. In recent years, the stability of off-policy learning for value-based reinforcement learning has been guaranteed even when combined with linear function approximation and bootstrapping. Convergence rate analysis is currently a hot topic. However, the convergence rates of learning algorithms vary, and analyzing the reasons behind this remains an open problem. In this paper, we propose an essentially simplified version of a convergence rate to generate general off-policy temporal difference learning algorithms. We emphasize that the primary determinant influencing convergence rate is the minimum eigenvalue of the key matrix. Furthermore, we conduct a comparative analysis of the influencing factor across various off-policy learning algorithms in diverse numerical scenarios. The experimental findings validate the proposed determinant, which serves as a benchmark for the design of more efficient learning algorithms.

引用

页数：19

共 50 条

[1] RATE OF CONVERGENCE
PTAK, V
NUMERICAL FUNCTIONAL ANALYSIS AND OPTIMIZATION, 1979, 1 (03) : 255 - 271
[2] RATE OF CONVERGENCE IN MARTINGALE CONVERGENCE THEOREM
HEYDE, CC
ADVANCES IN APPLIED PROBABILITY, 1977, 9 (02) : 196 - 196
[3] Convergence Rate and Convergence of Genetic Algorithms
LIU Feng
LIU Guizhong
ZHANG Zhuosheng(Institute for Information Engineering
University
Journal of Systems Science and Systems Engineering, 1999, (01) : 73 - 81
[4] The rate of convergence of AdaBoost
Mukherjee, Indraneel
Rudin, Cynthia
Schapire, Robert E.
Journal of Machine Learning Research, 2013, 14 : 2315 - 2347
[5] Convergence and convergence rate of projection relaxation iterations
Zeng, Jinping
Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 1992, 19 (05): : 37 - 41
[6] Convergence and rate of convergence of a foraging ant model
Boumaza, Amine
Scherrer, Bruno
2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 469 - 476
[7] On the convergence rate of DGMRES
Greenbaum, Anne
Kyanfar, Faranges
Salemi, Abbas
LINEAR ALGEBRA AND ITS APPLICATIONS, 2018, 552 : 219 - 238
[8] Iterational rate of convergence
Dale, Knut
AMERICAN MATHEMATICAL MONTHLY, 2008, 115 (02): : 173 - 173
[9] On Rate of Convergence of Sequences
Tripathy, Binod Chandra
NEW TRENDS IN ANALYSIS AND INTERDISCIPLINARY APPLICATIONS, 2017, : 435 - 440
[10] RATE OF CONVERGENCE OF THE CORE
AUMANN, RJ
INTERNATIONAL ECONOMIC REVIEW, 1979, 20 (02) : 349 - 357

← 1 2 3 4 5 →