Vision-Based Reinforcement Learning using Approximate Policy Iteration

被引：0

作者：

Shaker, Marwan R. ^{[1
]}

Yue, Shigang ^{[1
]}

Duckett, Tom ^{[1
]}

机构：

[1] Lincoln Univ, Dept Comp & Informat, Lincoln LN6 7TS, England

来源：

ICAR: 2009 14TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, VOLS 1 AND 2 | 2009年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A major issue for reinforcement learning (RL) applied to robotics is the time required to learn a new skill. While RL has been used to learn mobile robot control in many simulated domains, applications involving learning on real robots are still relatively rare. In this paper, the Least-Squares Policy Iteration (LSPI) reinforcement learning algorithm and a new model-based algorithm Least-Squares Policy Iteration with Prioritized Sweeping (LSPI+), are implemented on a mobile robot to acquire new skills quickly and efficiently. LSPI+ combines the benefits of LSPI and prioritized sweeping, which uses all previous experience to focus the computational effort on the most "interesting" or dynamic parts of the state space. The proposed algorithms are tested on a household vacuum cleaner robot for learning a docking task using vision as the only sensor modality. In experiments these algorithms are compared to other model-based and model-free RL algorithms. The results show that the number of trials required to learn the docking task is significantly reduced using LSPI compared to the other RL algorithms investigated, and that LSPI+ further improves on the performance of LSPI.

引用

页码：594 / 599

页数：6

共 50 条

[1] Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning
Lee, Keuntaek
Vlahov, Bogdan
Gibson, Jason
Rehg, James M.
Theodorou, Evangelos A.
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10793 - 10799
[2] Vision-based Navigation Using Deep Reinforcement Learning
Kulhanek, Jonas
Derner, Erik
de Bruin, Tim
Babuska, Robert
[J]. 2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
[3] Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration
Zhang, Pengchen
Xu, Xin
Liu, Chunming
Yuan, Qiping
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 278 - 288
[4] Vision-based reinforcement learning for robot navigation
Zhu, WY
Levinson, S
[J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1025 - 1030
[5] Multi-agent reinforcement learning using ordinal action selection and approximate policy iteration
Liu, Daxue
Wu, Jun
Xu, Xin
[J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2016, 14 (06)
[6] Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
Wang, Chang
Wang, Jiaqing
Wei, Changyun
Zhu, Yi
Yin, Dong
Li, Jie
[J]. DRONES, 2023, 7 (11)
[7] Intrinsically Motivated NeuroEvolution for Vision-Based Reinforcement Learning
Cuccu, Giuseppe
Luciw, Matthew
Schmidhuber, Juergen
Gomez, Faustino
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
[8] Continual Vision-based Reinforcement Learning with Group Symmetries
Liu, Shiqi
Xu, Mengdi
Huang, Peide
Zhang, Xilun
Liu, Yongkang
Oguchi, Kentaro
Zhao, Ding
[J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[9] Vision-based Deep Reinforcement Learning to Control a Manipulator
Kim, Wonchul
Kim, Taewan
Lee, Jonggu
Kim, H. Jin
[J]. 2017 11TH ASIAN CONTROL CONFERENCE (ASCC), 2017, : 1046 - 1050
[10] Review of vision-based reinforcement learning for drone navigation
Aburaya, Anas
Selamat, Hazlina
Muslim, Mohd Taufiq
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024,

← 1 2 3 4 5 →