Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引:38
|
作者
Zhou, Wenhong [1 ]
LI, Jie [1 ]
Liu, Zhihong [1 ]
Shen, Lincheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;
D O I
10.1016/j.cja.2021.09.008
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:100 / 112
页数:13
相关论文
共 50 条
  • [1] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    Chinese Journal of Aeronautics, 2022, 35 (07) : 100 - 112
  • [2] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    Chinese Journal of Aeronautics, 2022, (07) : 100 - 112
  • [3] Improving Cooperative Multi-Target Tracking Control for UAV Swarm Using Multi-Agent Reinforcement Learning
    Yue, Longfei
    Lv, Maolong
    Yan, Mengda
    Zhao, Xiaoru
    Wu, Ao
    Li, Leyan
    Zuo, Jialiang
    2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 179 - 186
  • [4] Factored Multi-Agent Soft Actor-Critic for Cooperative Multi-Target Tracking of UAV Swarms
    Yue, Longfei
    Yang, Rennong
    Zuo, Jialiang
    Yan, Mengda
    Zhao, Xiaoru
    Lv, Maolong
    DRONES, 2023, 7 (03)
  • [5] Joint Communication and Action Learning in Multi-Target Tracking of UAV Swarms with Deep Reinforcement Learning
    Zhou, Wenhong
    Li, Jie
    Zhang, Qingjie
    DRONES, 2022, 6 (11)
  • [6] Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning
    Kouzeghar, Maryam
    Song, Youngbin
    Meghjani, Malika
    Bouffanais, Roland
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3289 - 3295
  • [7] UAV Swarm Cooperative Target Search: A Multi-Agent Reinforcement Learning Approach
    Hou, Yukai
    Zhao, Jin
    Zhang, Rongqing
    Cheng, Xiang
    Yang, Liuqing
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 568 - 578
  • [8] Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning
    Xia, Jiawei
    Luo, Yasong
    Liu, Zhikun
    Zhang, Yalun
    Shi, Haoran
    Liu, Zhong
    DEFENCE TECHNOLOGY, 2023, 29 : 80 - 94
  • [9] Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning
    Jiawei Xia
    Yasong Luo
    Zhikun Liu
    Yalun Zhang
    Haoran Shi
    Zhong Liu
    Defence Technology, 2023, 29 (11) : 80 - 94
  • [10] Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking
    Xia, Zhaoyue
    Du, Jun
    Wang, Jingjing
    Jiang, Chunxiao
    Ren, Yong
    Li, Gang
    Han, Zhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 931 - 945