Noise tolerance in reinforcement learning algorithms

被引：3

作者：

Ribeiro, Richardson ^{[1
]}

Koerich, Alessandro L. ^{[1
]}

Enembreck, Fabricio ^{[1
]}

机构：

[1] Pontif Cathol Univ Parana, Grad Program Comp Sci PPGIa, BR-80215901 Curitiba, Parana, Brazil

来源：

PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007) | 2007年

关键词：

adaptive autonomous agents; reinforcement learning and noise tolerant learning;

D O I：

10.1109/IAT.2007.94

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumulate many rewards for its actions. However, the amount of rewards received by the agent is not a guarantee Of convergence to an optimal policy of action due to the noises produced by the environment. Therefore, we propose a noise tolerance mechanism which is able to estimate convergent policies without causing delays or an unexpected speedup in the agent's learning. Experimental results have shown that the proposed mechanism is able to speed up the convergence of the agent achieving good action policies very fast even in dynamic and noisy environments.

引用

页码：265 / 268

页数：4

共 50 条

[21] Formalizing the ant algorithms in terms of reinforcement learning
Nowé, A
Verbeeck, K
ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 1999, 1674 : 616 - 620
[22] Integrating reinforcement learning, bidding and genetic algorithms
Qi, DH
Sun, R
IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 53 - 59
[23] EPOCH-INCREMENTAL REINFORCEMENT LEARNING ALGORITHMS
Zajdel, Roman
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 623 - 635
[24] Parallelization of Reinforcement Learning Algorithms for Video Games
Kopel, Marek
Szczurek, Witold
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 195 - 207
[25] Universal Reinforcement Learning Algorithms: Survey and Experiments
Aslanides, John
Leike, Jan
Hutter, Marcus
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1403 - 1410
[26] Application of Reinforcement Learning in Dynamic Pricing Algorithms
Wang Jintian
Zhou Lei
2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 419 - 423
[27] Offline Evaluation of Online Reinforcement Learning Algorithms
Mandel, Travis
Liu, Yun-En
Brunskill, Emma
Popovic, Zoran
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
[28] Reinforcement Learning Models and Algorithms for Diabetes Management
Yau, Kok-Lim Alvin
Chong, Yung-Wey
Fan, Xiumei
Wu, Celimuge
Saleem, Yasir
Lim, Phei-Ching
IEEE ACCESS, 2023, 11 : 28391 - 28415
[29] Reinforcement Learning Algorithms with Selector, Tuner, or Estimator
Ala’eddin Masadeh
Zhengdao Wang
Ahmed E. Kamal
Arabian Journal for Science and Engineering, 2024, 49 : 4081 - 4095
[30] Reinforcement learning for online control of evolutionary algorithms
Eiben, A. E.
Horvath, Mark
Kowalczyk, Wojtek
Schut, Martijn C.
ENGINEERING SELF-ORGANISING SYSTEMS, 2007, 4335 : 151 - +

← 1 2 3 4 5 →