An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引：19

作者：

Xing, Bowen ^{[1
]}

Wang, Xiao ^{[1
,2
]}

Yang, Liu ^{[1
]}

Liu, Zhenchong ^{[3
]}

Wu, Qingyun ^{[1
]}

机构：

[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China

[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China

[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2023年 / 11卷 / 03期

关键词：

environment modeling; raster map; screening matrix; DQN; reward function;

D O I：

10.3390/jmse11030645

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.

引用

下载

页数：19

共 50 条

[31] Global Path Planning of Unmanned Surface Vehicle Based on Improved A-Star Algorithm
Zhang, Huixia
Tao, Yadong
Zhu, Wenliang
SENSORS, 2023, 23 (14)
[32] Unmanned aerial vehicle path planning based on TLBO algorithm
Yu, Guolin (guolin_yu@126.com), 1600, Massey University (07):
[33] Global path planning of unmanned vehicle based on improved A* algorithm
Liang, Hao
Du, Xiaofang
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 176 - 184
[34] UNMANNED AERIAL VEHICLE PATH PLANNING BASED ON TLBO ALGORITHM
Yu, Guolin
Song, Hui
Gao, Jie
INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (03) : 1310 - 1325
[35] Unmanned aircraft vehicle path planning based on SVM algorithm
Chen, Yanhong
Zu, Wei
Fan, Guoliang
Chang, Hongxing
Advances in Intelligent Systems and Computing, 2014, 215 : 705 - 714
[36] Application of Improved Genetic Algorithm to Unmanned Surface Vehicle Path Planning
Long, Yang
Su, Yixin
Zhang, Huajun
Li, Ming
PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 209 - 212
[37] An Improved Genetic Algorithm for Path-Planning of Unmanned Surface Vehicle
Xin, Junfeng
Zhong, Jiabao
Yang, Fengru
Cui, Ying
Sheng, Jinlu
SENSORS, 2019, 19 (11)
[38] Hybrid bacterial foraging algorithm for unmanned surface vehicle path planning
Long Y.
Su Y.
Lian C.
Zhang D.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 68 - 73
[39] Reinforcement learning-based complete area coverage path planning for a modified htrihex robot
Apuroop, Koppaka Ganesh Sai
Le, Anh Vu
Elara, Mohan Rajesh
Sheu, Bing J.
Sensors (Switzerland), 2021, 21 (04): : 1 - 20
[40] Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot
Apuroop, Koppaka Ganesh Sai
Le, Anh Vu
Elara, Mohan Rajesh
Sheu, Bing J.
SENSORS, 2021, 21 (04) : 1 - 20

← 1 2 3 4 5 →