A Reinforcement Learning Based Online Coverage Path Planning Algorithm

被引：1

作者：

Carvalho, Jose Pedro ^{[1
,2
]}

Pedro Aguiar, A. ^{[1
,2
]}

机构：

[1] Univ Porto, Fac Engn, SYSTEC ARISE, Porto, Portugal

[2] Univ Porto, Fac Engn, ECE Dept, Porto, Portugal

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC | 2023年

关键词：

Coverage Path Planning; Reinforcement Learning; Temporal Differences Learning; Q-Learning; POMDP; GO;

D O I：

10.1109/ICARSC58346.2023.10129591

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Coverage Path Planning (CPP) is a common task in robotics that consists in computing collision-free paths that pass through all the specified points from an area of interest. This task is known to be NP-Hard, and increasingly complex when the agent relies exclusively on sensor information. Reinforcement Learning methods appear as an interesting solution to deal with the complexity of this problem and obtain efficient solutions. This paper presents an online CPP algorithm based on Tabular Temporal Difference Learning methods, for a generic robotic platform with a ranging sensor. The problem is formulated as a Partially Observed Markov Decision Process and an RL scheme that includes a modified policy with a heuristic method is proposed. The presented approach provides a way to mix the concepts of classical algorithms with RL, enabling the tabular algorithm to overcome the shortcomings of the inherent large state space of CPP, and accelerated the training process by optimizing and reducing the policy space. The proposed algorithm is tested and its performance is compared in simulation using different Temporal Difference Learning methods, showing that it can efficiently complete the task with no prior information, with different map sizes, starting positions, and a random number of obstacles.

引用

页码：81 / 86

页数：6

共 50 条

[1] An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning
Xing, Bowen
Wang, Xiao
Yang, Liu
Liu, Zhenchong
Wu, Qingyun
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (03)
[2] ε*: An Online Coverage Path Planning Algorithm
Song, Junnan
Gupta, Shalabh
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (02) : 526 - 533
[3] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
Zheng, Kaiyuan
Gao, Jingpeng
Shen, Liangxi
[J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
[4] UAV online path planning technology based on deep reinforcement learning
Fan, Jiaxuan
Wang, Zhenya
Ren, Jinlei
Lu, Ying
Liu, Yiheng
[J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
[5] Coverage path planning for kiwifruit picking robots based on deep reinforcement learning
Wang, Yinchu
He, Zhi
Cao, Dandan
Ma, Li
Li, Kai
Jia, Liangsheng
Cui, Yongjie
[J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
[6] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
Li, Song
Ma, Zhuangzhuang
Zhang, Yunlin
Shao, Jinliang
[J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113
[7] Research on Full Coverage Path Planning Based on Reinforcement Learning in Nuclear Environment
Wang, Shiqi
Song, Shuzong
Liu, Zhenni
Ma, Lijun
[J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (04)
[8] An Algorithm of Complete Coverage Path Planning for Deep-Sea Mining Vehicle Clusters Based on Reinforcement Learning
Xing, Bowen
Wang, Xiao
Liu, Zhenchong
[J]. ADVANCED THEORY AND SIMULATIONS, 2024, 7 (04)
[9] Coverage Path Planning Optimization Based on Q-Learning Algorithm
Piardi, Luis
Lima, Jose
Pereira, Ana, I
Costa, Paulo
[J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
[10] An adaptive gain parameters algorithm for path planning based on reinforcement learning
Yu, JL
[J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3557 - 3562

← 1 2 3 4 5 →