UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引：47

作者：

Li, Bo ^{[1
]}

Gan, Zhigang ^{[1
]}

Chen, Daqing ^{[2
]}

Sergey Aleksandrovich, Dyachenko ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

[2] London South Bank Univ, Sch Engn, London SE1 0AA, England

[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia

来源：

REMOTE SENSING | 2020年 / 12卷 / 22期

关键词：

UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;

D O I：

10.3390/rs12223789

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.

引用

页码：1 / 20

页数：20

共 50 条

[31] A Deep Reinforcement Learning Visual Servoing Control Strategy for Target Tracking Using a Multirotor UAV
Mitakidis, Andreas
Aspragkathos, Sotirios N.
Panetsos, Fotis
Karras, George C.
Kyriakopoulos, Kostas J.
2023 9TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS, ICARA, 2023, : 219 - 224
[32] Meta-Learning for Multi-objective Reinforcement Learning
Chen, Xi
Ghadirzadeh, Ali
Bjorkman, Marten
Jensfelt, Patric
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 977 - 983
[33] Learn to chill - Intelligent Chiller Scheduling using Meta-learning and Deep Reinforcement Learning
Manoharan, Praveen
Venkat, Malini Pooni
Nagarathinam, Srinarayana
Vasan, Arunchandar
BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 21 - 30
[34] A survey of deep meta-learning
Mike Huisman
Jan N. van Rijn
Aske Plaat
Artificial Intelligence Review, 2021, 54 : 4483 - 4541
[35] A survey of deep meta-learning
Huisman, Mike
van Rijn, Jan N.
Plaat, Aske
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (06) : 4483 - 4541
[36] Multi-UAV Target-Finding in Simulated Indoor Environments using Deep Reinforcement Learning
Walker, Ory
Vanegas, Fernando
Gonzalez, Felipe
Koenig, Sven
2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
[37] UAV navigation in high dynamic environments: A deep reinforcement learning approach
Guo, Tong
Jiang, Nan
Li, Biyue
Zhu, Xi
Wang, Ya
Du, Wenbo
CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (02) : 479 - 489
[38] UAV navigation in high dynamic environments:A deep reinforcement learning approach
Tong GUO
Nan JIANG
Biyue LI
Xi ZHU
Ya WANG
Wenbo DU
Chinese Journal of Aeronautics, 2021, 34 (02) : 479 - 489
[39] UAV Autonomous Target Search Based on Deep Reinforcement Learning in Complex Disaster Scene
Wu, Chunxue
Ju, Bobo
Wu, Yan
Lin, Xiao
Xiong, Naixue
Xu, Guangquan
Li, Hongyan
Liang, Xuefeng
IEEE ACCESS, 2019, 7 : 117227 - 117245
[40] Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
Shi, Jiaxiang
Fang, Jianer
Zhang, Qizhong
Wu, Qiuxuan
Zhang, Botao
Gao, Farong
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (10)

← 1 2 3 4 5 →