Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships

被引:8
|
作者
Wu, Chuanbo [1 ,3 ]
Yu, Wangneng [1 ,2 ,3 ]
Li, Guangze [1 ,3 ]
Liao, Weiqiang [1 ,2 ,3 ]
机构
[1] Jimei Univ, Sch Marine Engn, Xiamen 361021, Peoples R China
[2] Fujian Prov Key Lab Naval Architecture & Ocean Eng, Xiamen 361021, Peoples R China
[3] Fujian Engn & Res Ctr Offshore Small Green Intelli, Xiamen 361021, Peoples R China
基金
中国国家自然科学基金;
关键词
Ship collision avoidance; Dynamic window approach; Deep reinforcement learning; Maritime autonomous surface ships; OPTIMIZATION; ALGORITHM;
D O I
10.1016/j.oceaneng.2023.115208
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Automatic obstacle avoidance technology is one of the key technologies for ship intelligence. The purpose of this paper is to investigate the obstacle avoidance problem of maritime autonomous surface ships(MASS) in a complex offshore environment, and an obstacle avoidance strategy based on deep reinforcement learning and a dynamic window algorithm was proposed. To solve the collision avoidance problems that may occur during intelligent ship navigation, the action space of the proximal policy optimization (PPO) algorithm is defined according to the description of ship motion by linear and angular velocity in the dynamic window approach (DWA). The maximum detection distance of the MASS is utilized to construct the ship safety domain, which determines the state space containing the information of this ship and the nearest obstacle. To solve the problem of sparse reward, the reward function of the PPO is improved by combining the evaluation functions for distance, velocity and heading in the DWA. To verify the effectiveness of the algorithm, simulation experiments are performed in various situations. It is also shown that the improved algorithm can make the optimal collision avoidance decision from the complex environment and can effectively realize autonomous collision avoidance path planning for the MASS.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep reinforcement learning based collision avoidance system for autonomous ships
    Wang, Yong
    Xu, Haixiang
    Feng, Hui
    He, Jianhua
    Yang, Haojie
    Li, Fen
    Yang, Zhen
    [J]. OCEAN ENGINEERING, 2024, 292
  • [2] Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning
    Wang, Chengbo
    Zhang, Xinyu
    Li, Ruijie
    Dong, Peifang
    [J]. COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 127 - 137
  • [3] An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
    Guo, Siyu
    Zhang, Xiuguo
    Zheng, Yisong
    Du, Yiquan
    [J]. SENSORS, 2020, 20 (02)
  • [4] Dynamic Tabu Search for Collision Avoidance in Autonomous Maritime Ships
    Alptekin, Burak
    Kahraman, Nihan
    [J]. IEEE ACCESS, 2024, 12 : 89763 - 89775
  • [5] Integrating situation-aware knowledge maps and dynamic window approach for safe path planning by maritime autonomous surface ships
    Song, Rongxin
    Papadimitriou, Eleonora
    Negenborn, Rudy R.
    van Gelder, Pieter
    [J]. OCEAN ENGINEERING, 2024, 311
  • [6] Collaborative collision avoidance for Maritime Autonomous Surface Ships: A review
    Akdag, Melih
    Solnor, Petter
    Johansen, Tor Arne
    [J]. OCEAN ENGINEERING, 2022, 250
  • [7] Adaptive Path Planning for Autonomous Ships Based on Deep Reinforcement Learning Combined with Images
    Zheng, Kangjie
    Zhang, Xinyu
    Wang, Chengbo
    Cui, Hao
    Wang, Leihao
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1706 - 1715
  • [8] Taming an Autonomous Surface Vehicle for Path Following and Collision Avoidance Using Deep Reinforcement Learning
    Meyer, Eivind
    Robinson, Haakon
    Rasheed, Adil
    San, Omer
    [J]. IEEE ACCESS, 2020, 8 : 41466 - 41481
  • [9] A novel path planning approach for unmanned ships based on deep reinforcement learning
    Chen, Chen
    Ma, Feng
    Liu, Jia-Lun
    Yan, Xin-Ping
    Chen, Xian-Qiao
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
  • [10] Path planning and dynamic collision avoidance algorithm under COLREGs via deep reinforcement learning
    Xu, Xinli
    Cai, Peng
    Ahmed, Zahoor
    Yellapu, Vidya Sagar
    Zhang, Weidong
    [J]. NEUROCOMPUTING, 2022, 468 : 181 - 197