UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引:47
|
作者
Li, Bo [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Sergey Aleksandrovich, Dyachenko [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia
关键词
UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;
D O I
10.3390/rs12223789
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [21] Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
    Zhao, Jiang
    Liu, Han
    Sun, Jiaming
    Wu, Kun
    Cai, Zhihao
    Ma, Yan
    Wang, Yingxun
    BIOMIMETICS, 2022, 7 (04)
  • [22] Deep Reinforcement Learning for Active Target Tracking
    Jeong, Heejin
    Hassani, Hamed
    Morari, Manfred
    Lee, Daniel D.
    Pappas, George J.
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1825 - 1831
  • [23] Research on Deep Learning Methods of UUV Maneuvering Target Tracking
    Dai, Tao
    Wang, Hongjian
    Ruan, Li
    Tong, Haiyan
    Wang, Haibin
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [24] Image-Based Deep Reinforcement Meta-Learning for Autonomous Lunar Landing
    Scorsoglio, Andrea
    D'Ambrosio, Andrea
    Ghilardi, Luca
    Gaudet, Brian
    Curti, Fabio
    Furfaro, Roberto
    JOURNAL OF SPACECRAFT AND ROCKETS, 2022, 59 (01) : 153 - 165
  • [25] Deep Reinforcement Meta-learning Guidance with Impact Angle Constraint
    Liang C.
    Wang W.-H.
    Lai C.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (05): : 611 - 620
  • [26] Visual Active Tracking Algorithm for UAV Cluster Based on Deep Reinforcement Learning
    Hu, Runqiao
    Wang, Shaofan
    Li, Ke
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1047 - 1061
  • [27] Vision Based UAV Tracking Using Deep Reinforcement Learning With Simulated Data
    Zaier, Mayssa
    Miled, Wided
    Akhloufi, Moulay A.
    AUTONOMOUS SYSTEMS: SENSORS, PROCESSING AND SECURITY FOR GROUND, AIR, SEA AND SPACE VEHICLES AND INFRASTRUCTURE 2022, 2022, 12115
  • [28] Deep reinforcement learning based missile guidance law design for maneuvering target interception
    Du, Mingjian
    Peng, Chi
    Ma, Jianjun
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 3733 - 3738
  • [29] A Deep Reinforcement Learning Framework for UAV Navigation in Indoor Environments
    Walker, Ory
    Vanegas, Fernando
    Gonzalez, Felipe
    Koenig, Sven
    2019 IEEE AEROSPACE CONFERENCE, 2019,
  • [30] Research on Target Capturing of UAV Circumnavigation Formation Based on Deep Reinforcement Learning
    Xia, Qianxin
    Li, Peng
    Shi, Xufeng
    Li, Qian
    Cai, Weijun
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3751 - 3762