A Reinforcement Learning Based Online Coverage Path Planning Algorithm

被引:1
|
作者
Carvalho, Jose Pedro [1 ,2 ]
Pedro Aguiar, A. [1 ,2 ]
机构
[1] Univ Porto, Fac Engn, SYSTEC ARISE, Porto, Portugal
[2] Univ Porto, Fac Engn, ECE Dept, Porto, Portugal
关键词
Coverage Path Planning; Reinforcement Learning; Temporal Differences Learning; Q-Learning; POMDP; GO;
D O I
10.1109/ICARSC58346.2023.10129591
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Coverage Path Planning (CPP) is a common task in robotics that consists in computing collision-free paths that pass through all the specified points from an area of interest. This task is known to be NP-Hard, and increasingly complex when the agent relies exclusively on sensor information. Reinforcement Learning methods appear as an interesting solution to deal with the complexity of this problem and obtain efficient solutions. This paper presents an online CPP algorithm based on Tabular Temporal Difference Learning methods, for a generic robotic platform with a ranging sensor. The problem is formulated as a Partially Observed Markov Decision Process and an RL scheme that includes a modified policy with a heuristic method is proposed. The presented approach provides a way to mix the concepts of classical algorithms with RL, enabling the tabular algorithm to overcome the shortcomings of the inherent large state space of CPP, and accelerated the training process by optimizing and reducing the policy space. The proposed algorithm is tested and its performance is compared in simulation using different Temporal Difference Learning methods, showing that it can efficiently complete the task with no prior information, with different map sizes, starting positions, and a random number of obstacles.
引用
收藏
页码:81 / 86
页数:6
相关论文
共 50 条
  • [1] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Yang, Liu
    Liu, Zhenchong
    Wu, Qingyun
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
  • [2] ε*: An Online Coverage Path Planning Algorithm
    Song, Junnan
    Gupta, Shalabh
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (02) : 526 - 533
  • [3] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaiyuan
    Gao, Jingpeng
    Shen, Liangxi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
  • [4] UAV online path planning technology based on deep reinforcement learning
    Fan, Jiaxuan
    Wang, Zhenya
    Ren, Jinlei
    Lu, Ying
    Liu, Yiheng
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
  • [5] Coverage path planning for kiwifruit picking robots based on deep reinforcement learning
    Wang, Yinchu
    He, Zhi
    Cao, Dandan
    Ma, Li
    Li, Kai
    Jia, Liangsheng
    Cui, Yongjie
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
  • [6] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
    Li, Song
    Ma, Zhuangzhuang
    Zhang, Yunlin
    Shao, Jinliang
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113
  • [7] Research on Full Coverage Path Planning Based on Reinforcement Learning in Nuclear Environment
    Wang, Shiqi
    Song, Shuzong
    Liu, Zhenni
    Ma, Lijun
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (04)
  • [8] An Algorithm of Complete Coverage Path Planning for Deep-Sea Mining Vehicle Clusters Based on Reinforcement Learning
    Xing, Bowen
    Wang, Xiao
    Liu, Zhenchong
    [J]. ADVANCED THEORY AND SIMULATIONS, 2024, 7 (04)
  • [9] Coverage Path Planning Optimization Based on Q-Learning Algorithm
    Piardi, Luis
    Lima, Jose
    Pereira, Ana, I
    Costa, Paulo
    [J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
  • [10] An adaptive gain parameters algorithm for path planning based on reinforcement learning
    Yu, JL
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3557 - 3562