Action Selection Methods in a Robotic Reinforcement Learning Scenario

被引：0

作者：

Cruz, Francisco ^{[1
,2
]}

Wuppen, Peter ^{[2
]}

Fazrie, Alvin ^{[2
]}

Weber, Cornelius ^{[2
]}

Wermter, Stefan ^{[2
]}

机构：

[1] Univ Cent Chile, Fac Ingn, Escuela Comp & Informat, Santiago, Chile

[2] Univ Hamburg, Dept Informat, Knowledge Technol, Hamburg, Germany

来源：

2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning allows an agent to learn a new task while autonomously exploring its environment. For this aim, the agent chooses an action to perform among the available ones for a certain state. Nonetheless, a common problem for a reinforcement learning agent is to find a proper balance between exploration and exploitation of actions in order to achieve an optimal behavior. This paper compares multiple approaches to the exploration/exploitation dilemma in reinforcement learning and, moreover, it implements an exemplary reinforcement learning task within the domain of domestic robotics to show the performance of different exploration policies on it. We perform the domestic task using epsilon-greedy, softmax, VDBE, and VDBE-Softmax with online and offline temporal-difference learning. The obtained results show that the agent is able to collect larger and faster reward by using the VDBE-Softmax exploration strategy with both Q-learning and SARSA.

引用

页数：6

共 50 条

[1] Empirical studies in action selection with reinforcement learning
Whiteson, Shimon
Taylor, Matthew E.
Stone, Peter
[J]. ADAPTIVE BEHAVIOR, 2007, 15 (01) : 33 - 50
[2] A Comparison of Action Selection Methods for Implicit Policy Method Reinforcement Learning in Continuous Action-Space
Nichols, Barry D.
[J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3785 - 3792
[3] Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Luo, Jianlan
Dong, Perry
Wu, Jeffrey
Kumar, Aviral
Geng, Xinyang
Levine, Sergey
[J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[4] Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
Dalal, Murtaza
Pathak, Deepak
Salakhutdinov, Ruslan
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Hierarchical Action Selection for Reinforcement Learning in Infinite Mario
Joshi, Mandar
Khobragade, Rakesh
Sarda, Saurabh
Deshpande, Umesh
Mohan, Shiwali
[J]. PROCEEDINGS OF THE SIXTH STARTING AI RESEARCHERS' SYMPOSIUM (STAIRS 2012), 2012, 241 : 162 - +
[6] Using suitable action selection rule in reinforcement learning
Ohta, M
Kumada, Y
Noda, I
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4358 - 4363
[7] Quantum computation for action selection using reinforcement learning
C. L. Chen
D. Y. Dong
Z. H. Chen
[J]. INTERNATIONAL JOURNAL OF QUANTUM INFORMATION, 2006, 4 (06) : 1071 - 1083
[8] Causal Based Action Selection Policy for Reinforcement Learning
Feliciano-Avelino, Ivan
Mendez-Molina, Arquimides
Morales, Eduardo F.
Enrique Sucar, L.
[J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 213 - 227
[9] A formal methods approach to interpretable reinforcement learning for robotic planning
Li, Xiao
Serlin, Zachary
Yang, Guang
Belta, Calin
[J]. SCIENCE ROBOTICS, 2019, 4 (37)
[10] Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario
Francisco Cruz
Richard Dazeley
Peter Vamplew
Ithan Moreira
[J]. Neural Computing and Applications, 2023, 35 : 18113 - 18130

← 1 2 3 4 5 →