Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks

被引:0
|
作者
Ramachandran, Deepak [1 ]
Gupta, Rakesh [2 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Honda Res Inst USA Inc, Mountain View, CA 94041 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to find a sequence of actions in real time to accomplish these tasks efficiently. We introduce a novel reinforcement learning algorithm called Smoothed Sarsa that learns a good policy for these delivery tasks by delaying the backup reinforcement step until the uncertainty in the state estimate improves. The state space is modeled by a Dynamic Bayesian Network and updated using a Region-based Particle Filter. We take advantage of the fact that only discrete (topological) representations of entity locations are needed for decision-making, to make the tracking and decision making more efficient. Our experiments show that policy search leads to faster task completion times as well as higher total reward compared to a manually crafted policy. Smoothed Sarsa learns a policy orders of magnitude faster than previous policy search algorithms. We demonstrate our results on the Player/Stage simulator and on the Pioneer robot.
引用
收藏
页码:3327 / +
页数:3
相关论文
共 50 条
  • [1] Factored SARSA(λ) algorithm of reinforcement learning
    Chen, H.W.
    Xie, J.P.
    Xie, L.J.
    2001, Science Press (38):
  • [2] Quantum Deep Reinforcement Learning for Robot Navigation Tasks
    Hohenfeld, Hans
    Heimann, Dirk
    Wiebe, Felix
    Kirchner, Frank
    IEEE ACCESS, 2024, 12 : 87217 - 87236
  • [3] Reinforcement Learning and Robust Control for Robot Compliance Tasks
    Cheng-Peng Kuan
    Kuu-young Young
    Journal of Intelligent and Robotic Systems, 1998, 23 : 165 - 182
  • [4] Reinforcement learning and robust control for robot compliance tasks
    Natl Chiao-Tung Univ, Hsinchu, Taiwan
    J Intell Rob Syst Theor Appl, 2-4 (165-182):
  • [5] Reinforcement learning and robust control for robot compliance tasks
    Kuan, CP
    Young, KY
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1998, 23 (2-4) : 165 - 182
  • [6] Improved SARSA and DQN algorithms for reinforcement learning
    Yao, Guangyu
    Zhang, Nan
    Duan, Zhenhua
    Tian, Cong
    THEORETICAL COMPUTER SCIENCE, 2025, 1027
  • [7] Experiments in sequenced reinforcement learning of reactive tasks by a mobile robot
    Buratti, D
    Caselli, S
    Zanichelli, F
    Doty, KL
    INTELLIGENT AUTONOMOUS SYSTEMS 6, 2000, : 535 - 542
  • [8] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
    Chen, Sheng-Lei
    Wei, Yan-Mei
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
  • [9] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
  • [10] Deep Reinforcement Learning with Experience Replay Based on SARSA
    Zhao, Dongbin
    Wang, Haitao
    Shao, Kun
    Zhu, Yuanheng
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,