On Convergence Rate of MRetrace

被引：0

作者：

Chen, Xingguo ^{[1
]}

Qin, Wangrong ^{[1
]}

Gong, Yu ^{[1
]}

Yang, Shangdong ^{[1
]}

Wang, Wenhao ^{[2
,3
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Peoples R China

[2] Natl Univ Def Technol, Coll Elect Engn, Changsha 410073, Peoples R China

[3] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Peoples R China

来源：

MATHEMATICS | 2024年 / 12卷 / 18期

关键词：

finite sample analysis; off-policy learning; minimum eigenvalues; MRetrace;

D O I：

10.3390/math12182930

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Off-policy is a key setting for reinforcement learning algorithms. In recent years, the stability of off-policy learning for value-based reinforcement learning has been guaranteed even when combined with linear function approximation and bootstrapping. Convergence rate analysis is currently a hot topic. However, the convergence rates of learning algorithms vary, and analyzing the reasons behind this remains an open problem. In this paper, we propose an essentially simplified version of a convergence rate to generate general off-policy temporal difference learning algorithms. We emphasize that the primary determinant influencing convergence rate is the minimum eigenvalue of the key matrix. Furthermore, we conduct a comparative analysis of the influencing factor across various off-policy learning algorithms in diverse numerical scenarios. The experimental findings validate the proposed determinant, which serves as a benchmark for the design of more efficient learning algorithms.

引用

页数：19

共 50 条

[41] ON THE CONVERGENCE RATE OF ANNEALING PROCESSES
CHIANG, TS
CHOW, YS
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1988, 26 (06) : 1455 - 1470
[42] ON SOME RATE OF CONVERGENCE QUESTIONS
Dao Quang Tuyen
STUDIA SCIENTIARUM MATHEMATICARUM HUNGARICA, 2010, 47 (03) : 373 - 387
[43] Rate of convergence in evolutionary computation
Stark, DR
Spall, JC
PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 1932 - 1937
[44] Rate of Convergence of the FOCUSS Algorithm
Xie, Kan
He, Zhaoshui
Cichocki, Andrzej
Fang, Xiaozhao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (06) : 1276 - 1289
[45] CONVERGENCE RATE OF THE GLIMM SCHEME
Modena, Stefano
Bianchini, Stefano
BULLETIN OF THE INSTITUTE OF MATHEMATICS ACADEMIA SINICA NEW SERIES, 2016, 11 (01): : 235 - 300
[46] ON THE RATE OF CONVERGENCE OF RELAXATION METHODS
PLUNKETT, R
QUARTERLY OF APPLIED MATHEMATICS, 1952, 10 (03) : 263 - 266
[47] Rate of convergence of positive series
Skaskiv O.B.
Ukrainian Mathematical Journal, 2004, 56 (12) : 1975 - 1988
[48] A NOTE ON THE RATE OF CONVERGENCE OF THE BOOTSTRAP
DIKTA, G
JOURNAL OF APPROXIMATION THEORY, 1993, 75 (01) : 112 - 114
[49] On the Rate of Convergence to a Poisson Process
Egorov, V. A.
VESTNIK ST PETERSBURG UNIVERSITY-MATHEMATICS, 2011, 44 (02) : 103 - 109
[50] ON THE RATE OF CONVERGENCE OF THE ECM ALGORITHM
MENG, XL
ANNALS OF STATISTICS, 1994, 22 (01): : 326 - 339

← 1 2 3 4 5 →