Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks

被引：0

作者：

Ramachandran, Deepak ^{[1
]}

Gupta, Rakesh ^{[2
]}

机构：

[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA

[2] Honda Res Inst USA Inc, Mountain View, CA 94041 USA

来源：

ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7 | 2009年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to find a sequence of actions in real time to accomplish these tasks efficiently. We introduce a novel reinforcement learning algorithm called Smoothed Sarsa that learns a good policy for these delivery tasks by delaying the backup reinforcement step until the uncertainty in the state estimate improves. The state space is modeled by a Dynamic Bayesian Network and updated using a Region-based Particle Filter. We take advantage of the fact that only discrete (topological) representations of entity locations are needed for decision-making, to make the tracking and decision making more efficient. Our experiments show that policy search leads to faster task completion times as well as higher total reward compared to a manually crafted policy. Smoothed Sarsa learns a policy orders of magnitude faster than previous policy search algorithms. We demonstrate our results on the Player/Stage simulator and on the Pioneer robot.

引用

页码：3327 / +

页数：3

共 50 条

[1] Factored SARSA(λ) algorithm of reinforcement learning
Chen, H.W.
Xie, J.P.
Xie, L.J.
2001, Science Press (38):
[2] Quantum Deep Reinforcement Learning for Robot Navigation Tasks
Hohenfeld, Hans
Heimann, Dirk
Wiebe, Felix
Kirchner, Frank
IEEE ACCESS, 2024, 12 : 87217 - 87236
[3] Reinforcement Learning and Robust Control for Robot Compliance Tasks
Cheng-Peng Kuan
Kuu-young Young
Journal of Intelligent and Robotic Systems, 1998, 23 : 165 - 182
[4] Reinforcement learning and robust control for robot compliance tasks
Natl Chiao-Tung Univ, Hsinchu, Taiwan
J Intell Rob Syst Theor Appl, 2-4 (165-182):
[5] Reinforcement learning and robust control for robot compliance tasks
Kuan, CP
Young, KY
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1998, 23 (2-4) : 165 - 182
[6] Improved SARSA and DQN algorithms for reinforcement learning
Yao, Guangyu
Zhang, Nan
Duan, Zhenhua
Tian, Cong
THEORETICAL COMPUTER SCIENCE, 2025, 1027
[7] Experiments in sequenced reinforcement learning of reactive tasks by a mobile robot
Buratti, D
Caselli, S
Zanichelli, F
Doty, KL
INTELLIGENT AUTONOMOUS SYSTEMS 6, 2000, : 535 - 542
[8] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
Chen, Sheng-Lei
Wei, Yan-Mei
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
[9] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
Iima, Hitoshi
Kuroe, Yasuaki
2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
[10] Deep Reinforcement Learning with Experience Replay Based on SARSA
Zhao, Dongbin
Wang, Haitao
Shao, Kun
Zhu, Yuanheng
PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,

← 1 2 3 4 5 →