Noise tolerance in reinforcement learning algorithms

被引：3

作者：

Ribeiro, Richardson ^{[1
]}

Koerich, Alessandro L. ^{[1
]}

Enembreck, Fabricio ^{[1
]}

机构：

[1] Pontif Cathol Univ Parana, Grad Program Comp Sci PPGIa, BR-80215901 Curitiba, Parana, Brazil

来源：

PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007) | 2007年

关键词：

adaptive autonomous agents; reinforcement learning and noise tolerant learning;

D O I：

10.1109/IAT.2007.94

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumulate many rewards for its actions. However, the amount of rewards received by the agent is not a guarantee Of convergence to an optimal policy of action due to the noises produced by the environment. Therefore, we propose a noise tolerance mechanism which is able to estimate convergent policies without causing delays or an unexpected speedup in the agent's learning. Experimental results have shown that the proposed mechanism is able to speed up the convergence of the agent achieving good action policies very fast even in dynamic and noisy environments.

引用

页码：265 / 268

页数：4

共 50 条

[1] Statistical Active Learning Algorithms for Noise Tolerance and Differential Privacy
Balcan, Maria Florina
Feldman, Vitaly
ALGORITHMICA, 2015, 72 (01) : 282 - 315
[2] Statistical Active Learning Algorithms for Noise Tolerance and Differential Privacy
Maria Florina Balcan
Vitaly Feldman
Algorithmica, 2015, 72 : 282 - 315
[3] Evolutionary algorithms for reinforcement learning
Moriarty, DE
Schultz, AC
Grefenstette, JJ
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
[4] Evolutionary Algorithms for Reinforcement Learning
Moriarty, David E.
Schultz, Alan C.
Grefenstette, John J.
Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
[5] Ensemble algorithms in reinforcement learning
Wiering, Marco A.
van Hasselt, Hado
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 930 - 936
[6] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
Bocsi, Botond
Csato, Lehel
KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
[7] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
KOKAR, MM
REVELIOTIS, SA
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894
[8] Aggregation of reinforcement learning algorithms
Jiang, Ju
Kamel, Mohamed S.
2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
[9] Convergence of reinforcement learning algorithms and acceleration of learning
Potapov, A
Ali, MK
PHYSICAL REVIEW E, 2003, 67 (02):
[10] A survey on Evolutionary Reinforcement Learning algorithms
Zhu, Qingling
Wu, Xiaoqiang
Lin, Qiuzhen
Ma, Lijia
Li, Jianqiang
Ming, Zhong
Chen, Jianyong
NEUROCOMPUTING, 2023, 556

← 1 2 3 4 5 →