Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引：38

作者：

Zhou, Wenhong ^{[1
]}

LI, Jie ^{[1
]}

Liu, Zhihong ^{[1
]}

Shen, Lincheng ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

CHINESE JOURNAL OF AERONAUTICS | 2022年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;

D O I：

10.1016/j.cja.2021.09.008

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：100 / 112

页数：13

共 50 条

[1] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
Wenhong ZHOU
Jie LI
Zhihong LIU
Lincheng SHEN
Chinese Journal of Aeronautics, 2022, 35 (07) : 100 - 112
[2] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
Wenhong ZHOU
Jie LI
Zhihong LIU
Lincheng SHEN
Chinese Journal of Aeronautics, 2022, (07) : 100 - 112
[3] Improving Cooperative Multi-Target Tracking Control for UAV Swarm Using Multi-Agent Reinforcement Learning
Yue, Longfei
Lv, Maolong
Yan, Mengda
Zhao, Xiaoru
Wu, Ao
Li, Leyan
Zuo, Jialiang
2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 179 - 186
[4] Factored Multi-Agent Soft Actor-Critic for Cooperative Multi-Target Tracking of UAV Swarms
Yue, Longfei
Yang, Rennong
Zuo, Jialiang
Yan, Mengda
Zhao, Xiaoru
Lv, Maolong
DRONES, 2023, 7 (03)
[5] Joint Communication and Action Learning in Multi-Target Tracking of UAV Swarms with Deep Reinforcement Learning
Zhou, Wenhong
Li, Jie
Zhang, Qingjie
DRONES, 2022, 6 (11)
[6] Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning
Kouzeghar, Maryam
Song, Youngbin
Meghjani, Malika
Bouffanais, Roland
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3289 - 3295
[7] UAV Swarm Cooperative Target Search: A Multi-Agent Reinforcement Learning Approach
Hou, Yukai
Zhao, Jin
Zhang, Rongqing
Cheng, Xiang
Yang, Liuqing
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 568 - 578
[8] Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning
Xia, Jiawei
Luo, Yasong
Liu, Zhikun
Zhang, Yalun
Shi, Haoran
Liu, Zhong
DEFENCE TECHNOLOGY, 2023, 29 : 80 - 94
[9] Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning
Jiawei Xia
Yasong Luo
Zhikun Liu
Yalun Zhang
Haoran Shi
Zhong Liu
Defence Technology, 2023, 29 (11) : 80 - 94
[10] Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking
Xia, Zhaoyue
Du, Jun
Wang, Jingjing
Jiang, Chunxiao
Ren, Yong
Li, Gang
Han, Zhu
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 931 - 945

← 1 2 3 4 5 →