Noise tolerance in reinforcement learning algorithms

被引:3
|
作者
Ribeiro, Richardson [1 ]
Koerich, Alessandro L. [1 ]
Enembreck, Fabricio [1 ]
机构
[1] Pontif Cathol Univ Parana, Grad Program Comp Sci PPGIa, BR-80215901 Curitiba, Parana, Brazil
关键词
adaptive autonomous agents; reinforcement learning and noise tolerant learning;
D O I
10.1109/IAT.2007.94
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumulate many rewards for its actions. However, the amount of rewards received by the agent is not a guarantee Of convergence to an optimal policy of action due to the noises produced by the environment. Therefore, we propose a noise tolerance mechanism which is able to estimate convergent policies without causing delays or an unexpected speedup in the agent's learning. Experimental results have shown that the proposed mechanism is able to speed up the convergence of the agent achieving good action policies very fast even in dynamic and noisy environments.
引用
收藏
页码:265 / 268
页数:4
相关论文
共 50 条
  • [21] Formalizing the ant algorithms in terms of reinforcement learning
    Nowé, A
    Verbeeck, K
    ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 1999, 1674 : 616 - 620
  • [22] Integrating reinforcement learning, bidding and genetic algorithms
    Qi, DH
    Sun, R
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 53 - 59
  • [23] EPOCH-INCREMENTAL REINFORCEMENT LEARNING ALGORITHMS
    Zajdel, Roman
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 623 - 635
  • [24] Parallelization of Reinforcement Learning Algorithms for Video Games
    Kopel, Marek
    Szczurek, Witold
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 195 - 207
  • [25] Universal Reinforcement Learning Algorithms: Survey and Experiments
    Aslanides, John
    Leike, Jan
    Hutter, Marcus
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1403 - 1410
  • [26] Application of Reinforcement Learning in Dynamic Pricing Algorithms
    Wang Jintian
    Zhou Lei
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 419 - 423
  • [27] Offline Evaluation of Online Reinforcement Learning Algorithms
    Mandel, Travis
    Liu, Yun-En
    Brunskill, Emma
    Popovic, Zoran
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
  • [28] Reinforcement Learning Models and Algorithms for Diabetes Management
    Yau, Kok-Lim Alvin
    Chong, Yung-Wey
    Fan, Xiumei
    Wu, Celimuge
    Saleem, Yasir
    Lim, Phei-Ching
    IEEE ACCESS, 2023, 11 : 28391 - 28415
  • [29] Reinforcement Learning Algorithms with Selector, Tuner, or Estimator
    Ala’eddin Masadeh
    Zhengdao Wang
    Ahmed E. Kamal
    Arabian Journal for Science and Engineering, 2024, 49 : 4081 - 4095
  • [30] Reinforcement learning for online control of evolutionary algorithms
    Eiben, A. E.
    Horvath, Mark
    Kowalczyk, Wojtek
    Schut, Martijn C.
    ENGINEERING SELF-ORGANISING SYSTEMS, 2007, 4335 : 151 - +