Ship Collision Avoidance Using Constrained Deep Reinforcement Learning

被引：0

作者：

Zhang, Rui ^{[1
]}

Wang, Xiao ^{[2
]}

Liu, Kezhong ^{[3
]}

Wu, Xiaolie ^{[4
]}

Lu, Tianyou ^{[2
]}

Chao Zhaohui ^{[2
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Hubei Key Lab Transportat Internet Things, Wuhan 434070, Hubei, Peoples R China

[2] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 434070, Hubei, Peoples R China

[3] Wuhan Univ Technol, Sch Nav, Hubei Key Lab Inland Shipping Technol, Wuhan 434070, Hubei, Peoples R China

[4] Wuhan Univ Technol, Sch Nav, Wuhan 434070, Hubei, Peoples R China

来源：

2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC) | 2018年

基金：

中国国家自然科学基金;

关键词：

reinforcement learning; constraint; collision avoidance; Deep Q Network;

D O I：

10.1109/BESC.2018.00031

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In recent years, the rapid development of mobile technology and application platforms has provided better services for life and work. Artificial intelligence and mobile technology have made traffic ever more convenient. As an artificial intelligence method that intersects with multiple disciplines and fields, reinforcement learning has been proved to be highly effective in the automatic driving of vehicles. However, there are still many difficulties in ship collision avoidance, because it involves continuous actions and complicated regulations. We find that by constraining the states, actions and regulation of reinforcement learning, we can well apply reinforcement learning to ship collision avoidance with vast states and actions at the same time. Hence, we propose Constrained-DQN(Deep Q Network), which is used to limit the state and action set, and separate reward value via different regulations. Experiments show that Constrained-DQN is more stable and adaptive in handling continuous space than traditional path planning algorithms.

引用

页码：115 / 120

页数：6

共 50 条

[21] A learning method for AUV collision avoidance through deep reinforcement learning
Xu, Jian
Huang, Fei
Wu, Di
Cui, Yunfei
Yan, Zheping
Du, Xue
OCEAN ENGINEERING, 2022, 260
[22] A Multi-Ship Collision Avoidance Algorithm Using Data-Driven Multi-Agent Deep Reinforcement Learning
Niu, Yihan
Zhu, Feixiang
Wei, Moxuan
Du, Yifan
Zhai, Pengyu
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (11)
[23] Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
Song, Sirui
Saunders, Kirk
Yue, Ye
Liu, Jundong
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 914 - 919
[24] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
Liu, Zuocheng
Neretin, Evgeny
Gao, Xiaoguang
Wan, Kaifang
2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246
[25] Formation Control with Collision Avoidance through Deep Reinforcement Learning
Sui, Zezhi
Pu, Zhiqiang
Yi, Jianqiang
Xiong, Tianyi
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[26] Multi-ship collaborative collision avoidance strategy based on multi-agent deep reinforcement learning
Huang R.
Luo L.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (06): : 1972 - 1988
[27] A human-like collision avoidance method for autonomous ship with attention-based deep reinforcement learning
Jiang, Lingling
An, Lanxuan
Zhang, Xinyu
Wang, Chengbo
Wang, Xinjian
OCEAN ENGINEERING, 2022, 264
[28] Research on collision avoidance method of intelligent ship navigation based on reinforcement learning
Yuan, Zhongmi
Ma, Lei
Liu, Xiaoqiu
Zhang, Weibin
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3220 - 3224
[29] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
Zheng, Mao
Xie, Shuo
Chu, Xiumin
Zhu, Tianquan
Tian, Guohao
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
[30] Deep-Reinforcement-Learning-Based Collision Avoidance in UAV Environment
Ouahouah, Sihem
Bagaa, Miloud
Prados-Garzon, Jonathan
Taleb, Tarik
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06) : 4015 - 4030

← 1 2 3 4 5 →