Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning

被引:31
|
作者
Wang, Yu [1 ]
Tang, Chong [2 ,3 ]
Wang, Shuo [1 ,4 ,5 ]
Cheng, Long [1 ]
Wang, Rui [1 ]
Tan, Min [1 ,4 ]
Hou, Zengguang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] NUCTECH Co Ltd, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[5] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai 200031, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Target tracking; Robots; Sports; Aerospace electronics; Mobile robots; Underwater vehicles; Biomimetic underwater vehicle (BUV); reinforcement learning; target tracking control; MOVING-TARGET; MOBILE ROBOT;
D O I
10.1109/TNNLS.2021.3054402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the underwater target tracking control problem of a biomimetic underwater vehicle (BUV) is addressed. Since it is difficult to build an effective mathematic model of a BUV due to the uncertainty of hydrodynamics, target tracking control is converted into the Markov decision process and is further achieved via deep reinforcement learning. The system state and reward function of underwater target tracking control are described. Based on the actor-critic reinforcement learning framework, the deep deterministic policy gradient actor-critic algorithm with supervision controller is proposed. The training tricks, including prioritized experience replay, actor network indirect supervision training, target network updating with different periods, and expansion of exploration space by applying random noise, are presented. Indirect supervision training is designed to address the issues of low stability and slow convergence of reinforcement learning in the continuous state and action space. Comparative simulations are performed to show the effectiveness of the training tricks. Finally, the proposed actor-critic reinforcement learning algorithm with supervision controller is applied to the physical BUV. Swimming pool experiments of underwater object tracking of the BUV are conducted in multiple scenarios to verify the effectiveness and robustness of the proposed method.
引用
收藏
页码:3741 / 3752
页数:12
相关论文
共 50 条
  • [41] A visual tracking system for a biomimetic autonomous underwater vehicle
    Guo, JH
    Wu, CH
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON UNDERWATER TECHNOLOGY, 2004, : 317 - 322
  • [42] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Guo, Rui
    Yuan, Wei
    [J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [43] Trajectory tracking control of an unmanned aerial vehicle with deep reinforcement learning for tasks inside the EAST
    Yu, Chao
    Yang, Yang
    Cheng, Yong
    Wang, Zheng
    Shi, Mingming
    [J]. FUSION ENGINEERING AND DESIGN, 2023, 194
  • [44] Deep reinforcement learning control of electric vehicle charging in the of
    Dorokhova, Marina
    Martinson, Yann
    Ballif, Christophe
    Wyrsch, Nicolas
    [J]. APPLIED ENERGY, 2021, 301
  • [45] Learning Transferable Policy in Reinforcement Learning for Vehicle Velocity Tracking Control
    Natsu, Yukiya
    Hamagami, Tomoki
    Kanke, Masayasu
    Yoshida, Kento
    Niwakawa, Makoto
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (12) : 1492 - 1499
  • [46] A Deep Reinforcement Learning Visual Servoing Control Strategy for Target Tracking Using a Multirotor UAV
    Mitakidis, Andreas
    Aspragkathos, Sotirios N.
    Panetsos, Fotis
    Karras, George C.
    Kyriakopoulos, Kostas J.
    [J]. 2023 9TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS, ICARA, 2023, : 219 - 224
  • [47] Deep reinforcement learning based path tracking controller for autonomous vehicle
    Chen, I-Ming
    Chan, Ching-Yao
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2021, 235 (2-3) : 541 - 551
  • [48] Vehicle Tracking in Wireless Sensor Networks via Deep Reinforcement Learning
    Li, Jun
    Xing, Zhichao
    Zhang, Weibin
    Lin, Yan
    Shu, Feng
    [J]. IEEE SENSORS LETTERS, 2020, 4 (03) : 1 - 4
  • [49] Maneuvering and control of a biomimetic autonomous underwater vehicle
    Jenhwa Guo
    [J]. Autonomous Robots, 2009, 26 : 241 - 249
  • [50] Maneuvering and control of a biomimetic autonomous underwater vehicle
    Guo, Jenhwa
    [J]. AUTONOMOUS ROBOTS, 2009, 26 (04) : 241 - 249