An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引:19
|
作者
Xing, Bowen [1 ]
Wang, Xiao [1 ,2 ]
Yang, Liu [1 ]
Liu, Zhenchong [3 ]
Wu, Qingyun [1 ]
机构
[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China
[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China
[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China
关键词
environment modeling; raster map; screening matrix; DQN; reward function;
D O I
10.3390/jmse11030645
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [31] Global Path Planning of Unmanned Surface Vehicle Based on Improved A-Star Algorithm
    Zhang, Huixia
    Tao, Yadong
    Zhu, Wenliang
    SENSORS, 2023, 23 (14)
  • [32] Unmanned aerial vehicle path planning based on TLBO algorithm
    Yu, Guolin (guolin_yu@126.com), 1600, Massey University (07):
  • [33] Global path planning of unmanned vehicle based on improved A* algorithm
    Liang, Hao
    Du, Xiaofang
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 176 - 184
  • [34] UNMANNED AERIAL VEHICLE PATH PLANNING BASED ON TLBO ALGORITHM
    Yu, Guolin
    Song, Hui
    Gao, Jie
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (03) : 1310 - 1325
  • [35] Unmanned aircraft vehicle path planning based on SVM algorithm
    Chen, Yanhong
    Zu, Wei
    Fan, Guoliang
    Chang, Hongxing
    Advances in Intelligent Systems and Computing, 2014, 215 : 705 - 714
  • [36] Application of Improved Genetic Algorithm to Unmanned Surface Vehicle Path Planning
    Long, Yang
    Su, Yixin
    Zhang, Huajun
    Li, Ming
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 209 - 212
  • [37] An Improved Genetic Algorithm for Path-Planning of Unmanned Surface Vehicle
    Xin, Junfeng
    Zhong, Jiabao
    Yang, Fengru
    Cui, Ying
    Sheng, Jinlu
    SENSORS, 2019, 19 (11)
  • [38] Hybrid bacterial foraging algorithm for unmanned surface vehicle path planning
    Long Y.
    Su Y.
    Lian C.
    Zhang D.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 68 - 73
  • [39] Reinforcement learning-based complete area coverage path planning for a modified htrihex robot
    Apuroop, Koppaka Ganesh Sai
    Le, Anh Vu
    Elara, Mohan Rajesh
    Sheu, Bing J.
    Sensors (Switzerland), 2021, 21 (04): : 1 - 20
  • [40] Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot
    Apuroop, Koppaka Ganesh Sai
    Le, Anh Vu
    Elara, Mohan Rajesh
    Sheu, Bing J.
    SENSORS, 2021, 21 (04) : 1 - 20