UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引:47
|
作者
Li, Bo [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Sergey Aleksandrovich, Dyachenko [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia
关键词
UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;
D O I
10.3390/rs12223789
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [41] Sensor Management Method Based on Deep Reinforcement Learning in Extended Target Tracking
    Zhang, Hong-Yun
    Chen, Hui
    Zhang, Wen-Xu
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (07): : 1417 - 1431
  • [42] Deep Reinforcement Learning Based Radar Parameter Adaptation for Multiple Target Tracking
    Huang Y.
    Guo R.
    Zhang Y.
    Chen Z.
    IEEE Transactions on Aerospace and Electronic Systems, 2024, 60 (05) : 1 - 18
  • [43] Target Tracking and Path Planning of Mobile Sensor Based on Deep Reinforcement Learning
    Zhang, Kun
    Hu, Yuanjiang
    Huang, Deqing
    Yin, Zijie
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 190 - 195
  • [44] SAR Target Recognition Based on Probabilistic Meta-Learning
    Wang, Ke
    Zhang, Gong
    Xu, Yanbing
    Leung, Henry
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (04) : 682 - 686
  • [45] UAV Dynamic Object Tracking with Lightweight Deep Vision Reinforcement Learning
    Nguyen, Hy
    Thudumu, Srikanth
    Du, Hung
    Mouzakis, Kon
    Vasa, Rajesh
    ALGORITHMS, 2023, 16 (05)
  • [46] DeepMTT: A deep learning maneuvering target-tracking algorithm based on bidirectional LSTM network
    Liu, Jingxian
    Wang, Zulin
    Xu, Mai
    INFORMATION FUSION, 2020, 53 : 289 - 304
  • [47] Autonomous UAV-based Target Search, Tracking and Following using Reinforcement Learning and YOLOFlow
    Ajmera, Yug
    Singh, Surya Pratap
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 15 - 20
  • [48] Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments
    Kong, Xiaoran
    Zhou, Yatong
    Li, Zhe
    Wang, Shaohai
    FRONTIERS IN NEUROROBOTICS, 2024, 17
  • [49] UAV maneuvering decision-making algorithm based on deep reinforcement learning under the guidance of expert experience
    ZHAN Guang
    ZHANG Kun
    LI Ke
    PIAO Haiyin
    Journal of Systems Engineering and Electronics, 2024, (03) : 644 - 665
  • [50] UAV Maneuvering Decision-Making Algorithm Based on Deep Reinforcement Learning Under the Guidance of Expert Experience
    Zhan, Guang
    Zhang, Kun
    Li, Ke
    Piao, Haiyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (03) : 644 - 665