Deep reinforcement learning based collision avoidance system for autonomous ships

被引:5
|
作者
Wang, Yong [2 ]
Xu, Haixiang [1 ,2 ]
Feng, Hui [1 ,2 ]
He, Jianhua [3 ]
Yang, Haojie [2 ]
Li, Fen [4 ]
Yang, Zhen [5 ]
机构
[1] Wuhan Univ Technol, Key Lab High Performance Ship Technol, Minist Educ, Wuhan 430063, Peoples R China
[2] Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R China
[3] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England
[4] Wuhan Univ Technol, Personnel Dept, Wuhan 430070, Peoples R China
[5] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519000, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Ship collision avoidance; Deep reinforcement learning; Parameter sharing; Navigation safety; Collision map;
D O I
10.1016/j.oceaneng.2023.116527
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Autonomous ships is a key to avoid accidents caused by human errors and improve maritime safety. However, unlike the autonomous vehicles counterpart, collision avoidance for autonomous ships faces many challenges due to the hash driving environments, difficult ship control and large stopping distance. In this paper, we investigate a collision avoidance system for autonomous ships under complex encounter scenarios, such as busy ports. In the system various sensors are used to detect objects and perceive the maritime environments. To help the autonomous ships handle the complex and dynamic scenarios that may be encountered, a collision map used to describe the ships encounter scenarios is generated and utilized as the input of a deep reinforcement learning (DRL) model. The DRL model is applied to make collision avoidance and safe driving decisions. New reward functions are proposed to train the DRL model to generate safe ship maneuver actions to reduce collisions and ensure compliance with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Furthermore, a self-adaptive parameters sharing approach is designed for fast convergence and collision avoidance performance of the DRL model, where the parameters of the fully connected layers are shared and the correlation layers are self adapted for the DRL critic and actor networks. Simulation results show that the proposed system has high DRL convergence speed and excellent collision avoidance.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [2] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    [J]. OCEAN ENGINEERING, 2021, 234
  • [3] Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships
    Wu, Chuanbo
    Yu, Wangneng
    Li, Guangze
    Liao, Weiqiang
    [J]. OCEAN ENGINEERING, 2023, 284
  • [4] Deep-Reinforcement-Learning-Based Collision Avoidance of Autonomous Driving System for Vulnerable Road User Safety
    Chen, Haochong
    Cao, Xincheng
    Guvenc, Levent
    Aksun-Guvenc, Bilin
    [J]. ELECTRONICS, 2024, 13 (10)
  • [5] A Collision Avoidance Method Based on Deep Reinforcement Learning
    Feng, Shumin
    Sebastian, Bijo
    Ben-Tzvi, Pinhas
    [J]. ROBOTICS, 2021, 10 (02)
  • [6] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Fei WANG
    Xiaoping ZHU
    Zhou ZHOU
    Yang TANG
    [J]. Chinese Journal of Aeronautics, 2024, 37 (03) : 237 - 257
  • [7] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
    Zhao, Luman
    Roh, Myung-Il
    Lee, Sung-Jun
    [J]. JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310
  • [8] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Wang, Fei
    Zhu, Xiaoping
    Zhou, Zhou
    Tang, Yang
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (03) : 237 - 257
  • [9] Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance
    Plissonneau, Antoine
    Jourdan, Luca
    Trentesaux, Damien
    Abdi, Lotfi
    Sallak, Mohamed
    Bekrar, Abdelghani
    Quost, Benjamin
    Schoen, Walter
    [J]. JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2024, 31
  • [10] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
    Liu, Zuocheng
    Neretin, Evgeny
    Gao, Xiaoguang
    Wan, Kaifang
    [J]. 2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246