An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引:19
|
作者
Xing, Bowen [1 ]
Wang, Xiao [1 ,2 ]
Yang, Liu [1 ]
Liu, Zhenchong [3 ]
Wu, Qingyun [1 ]
机构
[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China
[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China
[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China
关键词
environment modeling; raster map; screening matrix; DQN; reward function;
D O I
10.3390/jmse11030645
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [21] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    IEEE ACCESS, 2021, 9 : 24884 - 24900
  • [22] Maximum Information Coverage and Monitoring Path Planning with Unmanned Surface Vehicles Using Deep Reinforcement Learning
    Yanes Luis, Samuel
    Gutierrez Reina, Daniel
    Toral, Sergio
    OPTIMIZATION AND LEARNING, OLA 2022, 2022, 1684 : 13 - 24
  • [23] An Improved A-Star Algorithm for Complete Coverage Path Planning of Unmanned Ships
    Guo, Bo
    Kuang, Zhen
    Guan, Juhua
    Hu, Mengting
    Rao, Lanxiang
    Sun, Xiaoqiang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (03)
  • [24] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
    Woo, Joohyun
    Yu, Chanwoo
    Kim, Nakwan
    OCEAN ENGINEERING, 2019, 183 : 155 - 166
  • [25] Coverage path planning of unmanned surface vehicle based on improved biological inspired neural network
    Tang, Fei
    OCEAN ENGINEERING, 2023, 278
  • [26] Asynchronous Multithreading Reinforcement-Learning-Based Path Planning and Tracking for Unmanned Underwater Vehicle
    He, Zichen
    Dong, Lu
    Sun, Changyin
    Wang, Jiawei
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (05): : 2757 - 2769
  • [27] An Improved Deep Reinforcement Learning Algorithm for Path Planning in Unmanned Driving
    Yang, Kai
    Liu, Li
    IEEE ACCESS, 2024, 12 : 67935 - 67944
  • [28] A coverage path planning approach for environmental monitoring using an unmanned surface vehicle
    Ramkumar Sudha S.K.
    Mishra D.
    Hameed I.A.
    Ocean Engineering, 2024, 310
  • [29] Optimal search path planning for unmanned surface vehicle based on an improved genetic algorithm
    Guo, Hui
    Mao, Zhaoyong
    Ding, Wenjun
    Liu, Peiliang
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 79
  • [30] Path Planning for Unmanned Surface Vehicle based on genetic algorithm and sequential quadratic programming
    Zhuang, Yufei
    Wang, Cheng
    Huang, Haibin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3513 - 3518