Ship Collision Avoidance Using Constrained Deep Reinforcement Learning

被引:0
|
作者
Zhang, Rui [1 ]
Wang, Xiao [2 ]
Liu, Kezhong [3 ]
Wu, Xiaolie [4 ]
Lu, Tianyou [2 ]
Chao Zhaohui [2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Hubei Key Lab Transportat Internet Things, Wuhan 434070, Hubei, Peoples R China
[2] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 434070, Hubei, Peoples R China
[3] Wuhan Univ Technol, Sch Nav, Hubei Key Lab Inland Shipping Technol, Wuhan 434070, Hubei, Peoples R China
[4] Wuhan Univ Technol, Sch Nav, Wuhan 434070, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; constraint; collision avoidance; Deep Q Network;
D O I
10.1109/BESC.2018.00031
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, the rapid development of mobile technology and application platforms has provided better services for life and work. Artificial intelligence and mobile technology have made traffic ever more convenient. As an artificial intelligence method that intersects with multiple disciplines and fields, reinforcement learning has been proved to be highly effective in the automatic driving of vehicles. However, there are still many difficulties in ship collision avoidance, because it involves continuous actions and complicated regulations. We find that by constraining the states, actions and regulation of reinforcement learning, we can well apply reinforcement learning to ship collision avoidance with vast states and actions at the same time. Hence, we propose Constrained-DQN(Deep Q Network), which is used to limit the state and action set, and separate reward value via different regulations. Experiments show that Constrained-DQN is more stable and adaptive in handling continuous space than traditional path planning algorithms.
引用
收藏
页码:115 / 120
页数:6
相关论文
共 50 条
  • [41] Large Scale Pursuit-Evasion Under Collision Avoidance Using Deep Reinforcement Learning
    Yang, Helei
    Ge, Peng
    Cao, Junjie
    Yang, Yifan
    Liu, Yong
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2232 - 2239
  • [42] Toward Observation Based Least Restrictive Collision Avoidance Using Deep Meta Reinforcement Learning
    Asayesh, Salar
    Chen, Mo
    Mehrandezh, Mehran
    Gupta, Kamal
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04): : 7445 - 7452
  • [43] Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance
    Wang, Chengbo
    Wang, Ning
    Gao, Hongbo
    Wang, Leihao
    Zhao, Yizhuo
    Fang, Mingxing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (09) : 3715 - 3731
  • [44] WORK PROCESS TRANSFER REINFORCEMENT LEARNING: FEATURE EXTRACTION AND FINETUNING IN SHIP COLLISION AVOIDANCE
    Wang, Xinrui
    Jin, Yan
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
  • [45] Collision Avoidance in Crowded Zone Using Adversarial Reinforcement Learning
    Natesan, Balaji
    Liu, Chuan-Ming
    2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 276 - 283
  • [46] A composite learning method for multi-ship collision avoidance based on reinforcement learning and inverse control
    Xie, Shuo
    Chu, Xiumin
    Zheng, Mao
    Liu, Chenguang
    NEUROCOMPUTING, 2020, 411 (411) : 375 - 392
  • [47] COLREGs-compliant multiship collision avoidance based on deep reinforcement learning
    Zhao, Luman
    Roh, Myung-Il
    OCEAN ENGINEERING, 2019, 191
  • [48] Research on MASS Collision Avoidance in Complex Waters Based on Deep Reinforcement Learning
    Liu, Jiao
    Shi, Guoyou
    Zhu, Kaige
    Shi, Jiahui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [49] Collision Avoidance for Indoor Service Robots Through Multimodal Deep Reinforcement Learning
    Leiva, Francisco
    Lobos-Tsunekawa, Kenzo
    Ruiz-del-Solar, Javier
    ROBOT WORLD CUP XXIII, ROBOCUP 2019, 2019, 11531 : 140 - 153
  • [50] Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance
    Plissonneau, Antoine
    Jourdan, Luca
    Trentesaux, Damien
    Abdi, Lotfi
    Sallak, Mohamed
    Bekrar, Abdelghani
    Quost, Benjamin
    Schoen, Walter
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2024, 31