UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引:47
|
作者
Li, Bo [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Sergey Aleksandrovich, Dyachenko [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia
关键词
UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;
D O I
10.3390/rs12223789
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [31] A Deep Reinforcement Learning Visual Servoing Control Strategy for Target Tracking Using a Multirotor UAV
    Mitakidis, Andreas
    Aspragkathos, Sotirios N.
    Panetsos, Fotis
    Karras, George C.
    Kyriakopoulos, Kostas J.
    2023 9TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS, ICARA, 2023, : 219 - 224
  • [32] Meta-Learning for Multi-objective Reinforcement Learning
    Chen, Xi
    Ghadirzadeh, Ali
    Bjorkman, Marten
    Jensfelt, Patric
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 977 - 983
  • [33] Learn to chill - Intelligent Chiller Scheduling using Meta-learning and Deep Reinforcement Learning
    Manoharan, Praveen
    Venkat, Malini Pooni
    Nagarathinam, Srinarayana
    Vasan, Arunchandar
    BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 21 - 30
  • [34] A survey of deep meta-learning
    Mike Huisman
    Jan N. van Rijn
    Aske Plaat
    Artificial Intelligence Review, 2021, 54 : 4483 - 4541
  • [35] A survey of deep meta-learning
    Huisman, Mike
    van Rijn, Jan N.
    Plaat, Aske
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (06) : 4483 - 4541
  • [36] Multi-UAV Target-Finding in Simulated Indoor Environments using Deep Reinforcement Learning
    Walker, Ory
    Vanegas, Fernando
    Gonzalez, Felipe
    Koenig, Sven
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [37] UAV navigation in high dynamic environments: A deep reinforcement learning approach
    Guo, Tong
    Jiang, Nan
    Li, Biyue
    Zhu, Xi
    Wang, Ya
    Du, Wenbo
    CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (02) : 479 - 489
  • [38] UAV navigation in high dynamic environments:A deep reinforcement learning approach
    Tong GUO
    Nan JIANG
    Biyue LI
    Xi ZHU
    Ya WANG
    Wenbo DU
    Chinese Journal of Aeronautics, 2021, 34 (02) : 479 - 489
  • [39] UAV Autonomous Target Search Based on Deep Reinforcement Learning in Complex Disaster Scene
    Wu, Chunxue
    Ju, Bobo
    Wu, Yan
    Lin, Xiao
    Xiong, Naixue
    Xu, Guangquan
    Li, Hongyan
    Liang, Xuefeng
    IEEE ACCESS, 2019, 7 : 117227 - 117245
  • [40] Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Shi, Jiaxiang
    Fang, Jianer
    Zhang, Qizhong
    Wu, Qiuxuan
    Zhang, Botao
    Gao, Farong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (10)