Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study

被引：0

作者：

Patil, Mihir ^{[1
,2
]}

Wehbe, Bilal ^{[1
]}

Valdenegro-Toro, Matias ^{[1
]}

机构：

[1] Bonn Rhein Sieg Univ Appl Sci, D-53757 St Augustin, Germany

[2] German Res Ctr Artificial Intelligence, D-28359 Bremen, Germany

来源：

OCEANS 2021: SAN DIEGO - PORTO | 2021年

关键词：

D O I：

暂无

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Docking control of an autonomous underwater vehicle (AUV) is a task that is integral to achieving persistent long term autonomy. This work explores the application of state-of-the-art model-free deep reinforcement learning (DRL) approaches to the task of AUV docking in the continuous domain. We provide a detailed formulation of the reward function, utilized to successfully dock the AUV onto a fixed docking platform. A major contribution that distinguishes our work from the previous approaches is the usage of a physics simulator to define and simulate the underwater environment as well as the DeepLeng AUV. We propose a new reward function formulation for the docking task, incorporating several components, that outperforms previous reward formulations. We evaluate proximal policy optimization (PPO), twin delayed deep deterministic policy gradients (TD3) and soft actor-critic (SAC) in combination with our reward function. Our evaluation yielded results that conclusively show the TD3 agent to be most efficient and consistent in terms of docking the AUV, over multiple evaluation runs it achieved a 100% success rate and episode return of 10667.1 +/- 688.8. We also show how our reward function formulation improves over the state of the art.

引用

页数：7

共 50 条

[1] Benchmarking Deep Reinforcement Learning for Continuous Control
Duan, Yan
Chen, Xi
Houthooft, Rein
Schulman, John
Abbeel, Pieter
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[2] Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
Anderlini, Enrico
Parker, Gordon G.
Thomas, Giles
[J]. APPLIED SCIENCES-BASEL, 2019, 9 (17):
[3] Reinforcement learning: The application to autonomous biomimetic underwater vehicles control
Magalhaes, J.
Damas, B.
Lobo, V.
[J]. 4TH INTERNATIONAL SCIENTIFIC CONFERENCE SEA-CONF 2018, 2018, 172
[4] Continuous Control of Autonomous Vehicles using Plan-assisted Deep Reinforcement Learning
Dwivedi, Tanay
Betz, Tobias
Sauerbeck, Florian
Manivannan, P., V
Lienkamp, Markus
[J]. 2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 244 - 250
[5] Trajectory tracking control of vectored thruster autonomous underwater vehicles based on deep reinforcement learning
Liu, Tao
Zhao, Jintao
Hu, Yuli
Huang, Junhao
[J]. SHIPS AND OFFSHORE STRUCTURES, 2024,
[6] Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning
Carlucho, Ignacio
De Paula, Mariano
Wang, Sen
Petillot, Yvan
Acosta, Gerardo G.
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 : 71 - 86
[7] Design of formation control algorithm for multiple autonomous underwater vehicles based on deep reinforcement learning
Yan J.
Xu L.
Cao W.-Q.
Yang X.
Luo X.-Y.
[J]. Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1457 - 1463
[8] Adaptive Formation Motion Planning and Control of Autonomous Underwater Vehicles Using Deep Reinforcement Learning
Hadi, Behnaz
Khosravi, Alireza
Sarhadi, Pouria
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2024, 49 (01) : 311 - 328
[9] The docking control system of an autonomous underwater vehicle combining intelligent object recognition and deep reinforcement learning
Yu, Chao-Ming
Lin, Yu-Hsien
[J]. Engineering Applications of Artificial Intelligence, 2025, 139
[10] Deep reinforcement learning based control for Autonomous Vehicles in CARLA
Perez-Gil, Oscar
Barea, Rafael
Lopez-Guillen, Elena
Bergasa, Luis M.
Gomez-Huelamo, Carlos
Gutierrez, Rodrigo
Diaz-Diaz, Alejandro
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3553 - 3576

← 1 2 3 4 5 →