Noise tolerance in reinforcement learning algorithms

被引:3
|
作者
Ribeiro, Richardson [1 ]
Koerich, Alessandro L. [1 ]
Enembreck, Fabricio [1 ]
机构
[1] Pontif Cathol Univ Parana, Grad Program Comp Sci PPGIa, BR-80215901 Curitiba, Parana, Brazil
关键词
adaptive autonomous agents; reinforcement learning and noise tolerant learning;
D O I
10.1109/IAT.2007.94
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumulate many rewards for its actions. However, the amount of rewards received by the agent is not a guarantee Of convergence to an optimal policy of action due to the noises produced by the environment. Therefore, we propose a noise tolerance mechanism which is able to estimate convergent policies without causing delays or an unexpected speedup in the agent's learning. Experimental results have shown that the proposed mechanism is able to speed up the convergence of the agent achieving good action policies very fast even in dynamic and noisy environments.
引用
收藏
页码:265 / 268
页数:4
相关论文
共 50 条
  • [1] Statistical Active Learning Algorithms for Noise Tolerance and Differential Privacy
    Balcan, Maria Florina
    Feldman, Vitaly
    ALGORITHMICA, 2015, 72 (01) : 282 - 315
  • [2] Statistical Active Learning Algorithms for Noise Tolerance and Differential Privacy
    Maria Florina Balcan
    Vitaly Feldman
    Algorithmica, 2015, 72 : 282 - 315
  • [3] Evolutionary algorithms for reinforcement learning
    Moriarty, DE
    Schultz, AC
    Grefenstette, JJ
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
  • [4] Evolutionary Algorithms for Reinforcement Learning
    Moriarty, David E.
    Schultz, Alan C.
    Grefenstette, John J.
    Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
  • [5] Ensemble algorithms in reinforcement learning
    Wiering, Marco A.
    van Hasselt, Hado
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 930 - 936
  • [6] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
    Bocsi, Botond
    Csato, Lehel
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
  • [7] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
    KOKAR, MM
    REVELIOTIS, SA
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894
  • [8] Aggregation of reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
  • [9] Convergence of reinforcement learning algorithms and acceleration of learning
    Potapov, A
    Ali, MK
    PHYSICAL REVIEW E, 2003, 67 (02):
  • [10] A survey on Evolutionary Reinforcement Learning algorithms
    Zhu, Qingling
    Wu, Xiaoqiang
    Lin, Qiuzhen
    Ma, Lijia
    Li, Jianqiang
    Ming, Zhong
    Chen, Jianyong
    NEUROCOMPUTING, 2023, 556