Vision-Based Reinforcement Learning using Approximate Policy Iteration

被引:0
|
作者
Shaker, Marwan R. [1 ]
Yue, Shigang [1 ]
Duckett, Tom [1 ]
机构
[1] Lincoln Univ, Dept Comp & Informat, Lincoln LN6 7TS, England
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A major issue for reinforcement learning (RL) applied to robotics is the time required to learn a new skill. While RL has been used to learn mobile robot control in many simulated domains, applications involving learning on real robots are still relatively rare. In this paper, the Least-Squares Policy Iteration (LSPI) reinforcement learning algorithm and a new model-based algorithm Least-Squares Policy Iteration with Prioritized Sweeping (LSPI+), are implemented on a mobile robot to acquire new skills quickly and efficiently. LSPI+ combines the benefits of LSPI and prioritized sweeping, which uses all previous experience to focus the computational effort on the most "interesting" or dynamic parts of the state space. The proposed algorithms are tested on a household vacuum cleaner robot for learning a docking task using vision as the only sensor modality. In experiments these algorithms are compared to other model-based and model-free RL algorithms. The results show that the number of trials required to learn the docking task is significantly reduced using LSPI compared to the other RL algorithms investigated, and that LSPI+ further improves on the performance of LSPI.
引用
收藏
页码:594 / 599
页数:6
相关论文
共 50 条
  • [1] Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning
    Lee, Keuntaek
    Vlahov, Bogdan
    Gibson, Jason
    Rehg, James M.
    Theodorou, Evangelos A.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10793 - 10799
  • [2] Vision-based Navigation Using Deep Reinforcement Learning
    Kulhanek, Jonas
    Derner, Erik
    de Bruin, Tim
    Babuska, Robert
    [J]. 2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
  • [3] Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration
    Zhang, Pengchen
    Xu, Xin
    Liu, Chunming
    Yuan, Qiping
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 278 - 288
  • [4] Vision-based reinforcement learning for robot navigation
    Zhu, WY
    Levinson, S
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1025 - 1030
  • [5] Multi-agent reinforcement learning using ordinal action selection and approximate policy iteration
    Liu, Daxue
    Wu, Jun
    Xu, Xin
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2016, 14 (06)
  • [6] Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
    Wang, Chang
    Wang, Jiaqing
    Wei, Changyun
    Zhu, Yi
    Yin, Dong
    Li, Jie
    [J]. DRONES, 2023, 7 (11)
  • [7] Intrinsically Motivated NeuroEvolution for Vision-Based Reinforcement Learning
    Cuccu, Giuseppe
    Luciw, Matthew
    Schmidhuber, Juergen
    Gomez, Faustino
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
  • [8] Continual Vision-based Reinforcement Learning with Group Symmetries
    Liu, Shiqi
    Xu, Mengdi
    Huang, Peide
    Zhang, Xilun
    Liu, Yongkang
    Oguchi, Kentaro
    Zhao, Ding
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [9] Vision-based Deep Reinforcement Learning to Control a Manipulator
    Kim, Wonchul
    Kim, Taewan
    Lee, Jonggu
    Kim, H. Jin
    [J]. 2017 11TH ASIAN CONTROL CONFERENCE (ASCC), 2017, : 1046 - 1050
  • [10] Review of vision-based reinforcement learning for drone navigation
    Aburaya, Anas
    Selamat, Hazlina
    Muslim, Mohd Taufiq
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024,