Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning

被引:31
|
作者
Wang, Yu [1 ]
Tang, Chong [2 ,3 ]
Wang, Shuo [1 ,4 ,5 ]
Cheng, Long [1 ]
Wang, Rui [1 ]
Tan, Min [1 ,4 ]
Hou, Zengguang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] NUCTECH Co Ltd, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[5] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai 200031, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Target tracking; Robots; Sports; Aerospace electronics; Mobile robots; Underwater vehicles; Biomimetic underwater vehicle (BUV); reinforcement learning; target tracking control; MOVING-TARGET; MOBILE ROBOT;
D O I
10.1109/TNNLS.2021.3054402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the underwater target tracking control problem of a biomimetic underwater vehicle (BUV) is addressed. Since it is difficult to build an effective mathematic model of a BUV due to the uncertainty of hydrodynamics, target tracking control is converted into the Markov decision process and is further achieved via deep reinforcement learning. The system state and reward function of underwater target tracking control are described. Based on the actor-critic reinforcement learning framework, the deep deterministic policy gradient actor-critic algorithm with supervision controller is proposed. The training tricks, including prioritized experience replay, actor network indirect supervision training, target network updating with different periods, and expansion of exploration space by applying random noise, are presented. Indirect supervision training is designed to address the issues of low stability and slow convergence of reinforcement learning in the continuous state and action space. Comparative simulations are performed to show the effectiveness of the training tricks. Finally, the proposed actor-critic reinforcement learning algorithm with supervision controller is applied to the physical BUV. Swimming pool experiments of underwater object tracking of the BUV are conducted in multiple scenarios to verify the effectiveness and robustness of the proposed method.
引用
收藏
页码:3741 / 3752
页数:12
相关论文
共 50 条
  • [1] Position and Attitude Tracking Control of a Biomimetic Underwater Vehicle via Deep Reinforcement Learning
    Ma, Ruichen
    Wang, Yu
    Tang, Chong
    Wang, Shuo
    Wang, Rui
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (05) : 2810 - 2819
  • [2] Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Shi, Jiaxiang
    Fang, Jianer
    Zhang, Qizhong
    Wu, Qiuxuan
    Zhang, Botao
    Gao, Farong
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (10)
  • [3] Target Tracking Control of UAV Through Deep Reinforcement Learning
    Ma, Bodi
    Liu, Zhenbao
    Zhao, Wen
    Yuan, Jinbiao
    Long, Hao
    Wang, Xiao
    Yuan, Zhirong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 5983 - 6000
  • [4] Deep Reinforcement Learning Based Optimal Trajectory Tracking Control of Autonomous Underwater Vehicle
    Yu, Runsheng
    Shi, Zhenyu
    Huang, Chaoxing
    Li, Tenglong
    Ma, Qiongxiong
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 4958 - 4965
  • [5] Three-Dimensional Path Tracking Control of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Sun, Yushan
    Zhang, Chenming
    Zhang, Guocheng
    Xu, Hao
    Ran, Xiangrui
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2019, 7 (12)
  • [6] Adversarial deep reinforcement learning based robust depth tracking control for underactuated autonomous underwater vehicle
    Wang, Zhao
    Xiang, Xianbo
    Duan, Yu
    Yang, Shaolong
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [7] Optimal measurement strategies for target tracking by a biomimetic underwater vehicle
    Guo, Jenhwa
    [J]. OCEAN ENGINEERING, 2008, 35 (5-6) : 473 - 483
  • [8] Optimal measurement strategies for target tracking by a biomimetic underwater vehicle
    Guo, Jenhwa
    Tsai, Jin-Fa
    Chiu, Forng-Chen
    Lee, Ya-Jung
    [J]. PROCEEDINGS OF THE SEVENTEENTH (2007) INTERNATIONAL OFFSHORE AND POLAR ENGINEERING CONFERENCE, VOL 1- 4, PROCEEDINGS, 2007, : 1155 - +
  • [9] Guidance and control of autonomous surface underwater vehicles for target tracking in ocean environment by deep reinforcement learning
    Song, Dalei
    Gan, Wenhao
    Yao, Peng
    Zang, Wenchuan
    Zhang, Zhixuan
    Qu, Xiuqing
    [J]. OCEAN ENGINEERING, 2022, 250
  • [10] Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control
    Liu, Tao
    Hu, Yuli
    Xu, Hui
    [J]. COMPLEXITY, 2021, 2021