Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study

被引:0
|
作者
Patil, Mihir [1 ,2 ]
Wehbe, Bilal [1 ]
Valdenegro-Toro, Matias [1 ]
机构
[1] Bonn Rhein Sieg Univ Appl Sci, D-53757 St Augustin, Germany
[2] German Res Ctr Artificial Intelligence, D-28359 Bremen, Germany
关键词
D O I
暂无
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Docking control of an autonomous underwater vehicle (AUV) is a task that is integral to achieving persistent long term autonomy. This work explores the application of state-of-the-art model-free deep reinforcement learning (DRL) approaches to the task of AUV docking in the continuous domain. We provide a detailed formulation of the reward function, utilized to successfully dock the AUV onto a fixed docking platform. A major contribution that distinguishes our work from the previous approaches is the usage of a physics simulator to define and simulate the underwater environment as well as the DeepLeng AUV. We propose a new reward function formulation for the docking task, incorporating several components, that outperforms previous reward formulations. We evaluate proximal policy optimization (PPO), twin delayed deep deterministic policy gradients (TD3) and soft actor-critic (SAC) in combination with our reward function. Our evaluation yielded results that conclusively show the TD3 agent to be most efficient and consistent in terms of docking the AUV, over multiple evaluation runs it achieved a 100% success rate and episode return of 10667.1 +/- 688.8. We also show how our reward function formulation improves over the state of the art.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Benchmarking Deep Reinforcement Learning for Continuous Control
    Duan, Yan
    Chen, Xi
    Houthooft, Rein
    Schulman, John
    Abbeel, Pieter
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [2] Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
    Anderlini, Enrico
    Parker, Gordon G.
    Thomas, Giles
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [3] Reinforcement learning: The application to autonomous biomimetic underwater vehicles control
    Magalhaes, J.
    Damas, B.
    Lobo, V.
    [J]. 4TH INTERNATIONAL SCIENTIFIC CONFERENCE SEA-CONF 2018, 2018, 172
  • [4] Continuous Control of Autonomous Vehicles using Plan-assisted Deep Reinforcement Learning
    Dwivedi, Tanay
    Betz, Tobias
    Sauerbeck, Florian
    Manivannan, P., V
    Lienkamp, Markus
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 244 - 250
  • [5] Trajectory tracking control of vectored thruster autonomous underwater vehicles based on deep reinforcement learning
    Liu, Tao
    Zhao, Jintao
    Hu, Yuli
    Huang, Junhao
    [J]. SHIPS AND OFFSHORE STRUCTURES, 2024,
  • [6] Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning
    Carlucho, Ignacio
    De Paula, Mariano
    Wang, Sen
    Petillot, Yvan
    Acosta, Gerardo G.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 : 71 - 86
  • [7] Design of formation control algorithm for multiple autonomous underwater vehicles based on deep reinforcement learning
    Yan J.
    Xu L.
    Cao W.-Q.
    Yang X.
    Luo X.-Y.
    [J]. Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1457 - 1463
  • [8] Adaptive Formation Motion Planning and Control of Autonomous Underwater Vehicles Using Deep Reinforcement Learning
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2024, 49 (01) : 311 - 328
  • [9] The docking control system of an autonomous underwater vehicle combining intelligent object recognition and deep reinforcement learning
    Yu, Chao-Ming
    Lin, Yu-Hsien
    [J]. Engineering Applications of Artificial Intelligence, 2025, 139
  • [10] Deep reinforcement learning based control for Autonomous Vehicles in CARLA
    Perez-Gil, Oscar
    Barea, Rafael
    Lopez-Guillen, Elena
    Bergasa, Luis M.
    Gomez-Huelamo, Carlos
    Gutierrez, Rodrigo
    Diaz-Diaz, Alejandro
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3553 - 3576