Action Selection Methods in a Robotic Reinforcement Learning Scenario

被引:0
|
作者
Cruz, Francisco [1 ,2 ]
Wuppen, Peter [2 ]
Fazrie, Alvin [2 ]
Weber, Cornelius [2 ]
Wermter, Stefan [2 ]
机构
[1] Univ Cent Chile, Fac Ingn, Escuela Comp & Informat, Santiago, Chile
[2] Univ Hamburg, Dept Informat, Knowledge Technol, Hamburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning allows an agent to learn a new task while autonomously exploring its environment. For this aim, the agent chooses an action to perform among the available ones for a certain state. Nonetheless, a common problem for a reinforcement learning agent is to find a proper balance between exploration and exploitation of actions in order to achieve an optimal behavior. This paper compares multiple approaches to the exploration/exploitation dilemma in reinforcement learning and, moreover, it implements an exemplary reinforcement learning task within the domain of domestic robotics to show the performance of different exploration policies on it. We perform the domestic task using epsilon-greedy, softmax, VDBE, and VDBE-Softmax with online and offline temporal-difference learning. The obtained results show that the agent is able to collect larger and faster reward by using the VDBE-Softmax exploration strategy with both Q-learning and SARSA.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Empirical studies in action selection with reinforcement learning
    Whiteson, Shimon
    Taylor, Matthew E.
    Stone, Peter
    [J]. ADAPTIVE BEHAVIOR, 2007, 15 (01) : 33 - 50
  • [2] A Comparison of Action Selection Methods for Implicit Policy Method Reinforcement Learning in Continuous Action-Space
    Nichols, Barry D.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3785 - 3792
  • [3] Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
    Luo, Jianlan
    Dong, Perry
    Wu, Jeffrey
    Kumar, Aviral
    Geng, Xinyang
    Levine, Sergey
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [4] Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
    Dalal, Murtaza
    Pathak, Deepak
    Salakhutdinov, Ruslan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Hierarchical Action Selection for Reinforcement Learning in Infinite Mario
    Joshi, Mandar
    Khobragade, Rakesh
    Sarda, Saurabh
    Deshpande, Umesh
    Mohan, Shiwali
    [J]. PROCEEDINGS OF THE SIXTH STARTING AI RESEARCHERS' SYMPOSIUM (STAIRS 2012), 2012, 241 : 162 - +
  • [6] Using suitable action selection rule in reinforcement learning
    Ohta, M
    Kumada, Y
    Noda, I
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4358 - 4363
  • [7] Quantum computation for action selection using reinforcement learning
    C. L. Chen
    D. Y. Dong
    Z. H. Chen
    [J]. INTERNATIONAL JOURNAL OF QUANTUM INFORMATION, 2006, 4 (06) : 1071 - 1083
  • [8] Causal Based Action Selection Policy for Reinforcement Learning
    Feliciano-Avelino, Ivan
    Mendez-Molina, Arquimides
    Morales, Eduardo F.
    Enrique Sucar, L.
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 213 - 227
  • [9] A formal methods approach to interpretable reinforcement learning for robotic planning
    Li, Xiao
    Serlin, Zachary
    Yang, Guang
    Belta, Calin
    [J]. SCIENCE ROBOTICS, 2019, 4 (37)
  • [10] Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario
    Francisco Cruz
    Richard Dazeley
    Peter Vamplew
    Ithan Moreira
    [J]. Neural Computing and Applications, 2023, 35 : 18113 - 18130