A learning method for AUV collision avoidance through deep reinforcement learning

被引:12
|
作者
Xu, Jian [1 ]
Huang, Fei [1 ]
Wu, Di [1 ]
Cui, Yunfei [1 ]
Yan, Zheping [1 ]
Du, Xue [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Collision avoidance; Deep reinforcement learning; Soft actor-critic; Event-triggered;
D O I
10.1016/j.oceaneng.2022.112038
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates the collision avoidance problem for autonomous underwater vehicle (AUV) with limited detection range via deep reinforcement learning method in an unknown underwater environment. Firstly, to deal with both unknown static and dynamic obstacles, we introduce two different event-triggered mechanism to generate obstacle-related states and reward the safe collision avoidance action of AUV, respectively. Secondly, the complete state space is customized by combining the state of AUV itself and the state related to a target area. In addition, considering the task requirement of reaching a target area with the shortest path, the complete reward function is designed. Then, we propose a novel event-triggered soft actor-critic (ET-SAC) algorithm for AUV collision avoidance, and give its detailed pseudo code and implementation architecture. Finally, training and evaluation are carried out in the simulation platform we designed, including unknown static obstacle environment and unknown dynamic obstacle environment in 2D/3D space, and the results confirm that the proposed algorithm of this paper is feasible and effective for AUV to avoid collision safely.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Collision Avoidance Method Based on Deep Reinforcement Learning
    Feng, Shumin
    Sebastian, Bijo
    Ben-Tzvi, Pinhas
    [J]. ROBOTICS, 2021, 10 (02)
  • [2] Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Saunders, Kirk
    Yue, Ye
    Liu, Jundong
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 914 - 919
  • [3] Formation Control with Collision Avoidance through Deep Reinforcement Learning
    Sui, Zezhi
    Pu, Zhiqiang
    Yi, Jianqiang
    Xiong, Tianyi
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
    Liu, Zuocheng
    Neretin, Evgeny
    Gao, Xiaoguang
    Wan, Kaifang
    [J]. 2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246
  • [5] Vision-guided Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Zhang, Yuanhang
    Qin, Xi
    Saunders, Kirk
    Liu, Jundong
    [J]. PROCEEDINGS OF THE 2021 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2021, : 191 - 194
  • [6] AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning
    Yuan, Jianya
    Wang, Hongjian
    Zhang, Honghan
    Lin, Changjian
    Yu, Dan
    Li, Chengfeng
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (11)
  • [7] Collision avoidance controller for AUV systems using stochastic real value reinforcement learning method
    Sayyaadi, H
    Ura, T
    Fujii, T
    [J]. SICE 2000: PROCEEDINGS OF THE 39TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 2000, : 165 - 170
  • [8] Deep Reinforcement Learning for Collision Avoidance of Robotic Manipulators
    Sangiovanni, Bianca
    Rendiniello, Angelo
    Incremona, Gian Paolo
    Ferrara, Antonella
    Piastra, Marco
    [J]. 2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 2063 - 2068
  • [9] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Alireza Rafiei
    Amirhossein Oliaei Fasakhodi
    Farshid Hajati
    [J]. International Journal of Automotive Technology, 2022, 23 : 613 - 622
  • [10] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Rafiei, Alireza
    Fasakhodi, Amirhossein Oliaei
    Hajati, Farshid
    [J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2022, 23 (03) : 613 - 622