Cooperative Pursuit of UAV Cluster Based on Graph Embedding Reinforcement Learning

被引:1
|
作者
Guo, Wan-chun [1 ]
Xie, Wu-jie [1 ]
Dong, Wen-han [1 ]
He, Lei [1 ]
机构
[1] Air Force Engn Univ, Xian, Peoples R China
关键词
UAV cluster; mufti-Agent reinforcement learning; graph neural network;
D O I
10.1109/ICCEIC54227.2021.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the cooperative behavior decision-making problem of UAV cluster cooperative pursuit of escaping UAVs, a cooperative pursuit target decision-making process model based on Dec-POMDP is established.Aiming at the uncertain and partially observable scenes caused by the limited perception and communication ability of UAV cluster system state, a dynamic graph embedding method is proposed.The cluster state embedded in the graph is represented as the network input under the AC framework. Through information fusion, the individuals in the cluster system can perceive the information of the surrounding UAVs, so as to produce the global situation.Based on the idea of centralized evaluation and distributed execution, a multi-agent strategy gradient method for double delay depth determination based on empirical playback region reconstruction is proposed.This method can be effectively combined with the graph embedding method to represent the state of cluster system. The above method is applied to the target pursuit of UAV cluster, and the learning process has good convergence.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [1] Reinforcement Learning-Based Cooperative Adversarial Algorithm for UAV Cluster
    Li, Yan
    Gao, Yanlong
    Dai, Xunhua
    Nian, Xiaohong
    Wang, Haibo
    Xiong, HongYun
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1129 - 1138
  • [2] UAV Pursuit using Reinforcement Learning
    Bonnet, Alexandre
    Akhloufi, Moulay A.
    [J]. UNMANNED SYSTEMS TECHNOLOGY XXI, 2019, 11021
  • [3] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    [J]. AERONAUTICAL JOURNAL, 2022, 126 (1300): : 932 - 951
  • [4] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    [J]. Aeronautical Journal, 2022,
  • [5] The research on intelligent cooperative combat of UAV cluster with multi-agent reinforcement learning
    Xu D.
    Chen G.
    [J]. Aerospace Systems, 2022, 5 (1) : 107 - 121
  • [6] Virtual Network Embedding Based on Hierarchical Cooperative Multiagent Reinforcement Learning
    Lim, Hyun-Kyo
    Ullah, Ihsan
    Kim, Ju-Bong
    Han, Youn-Hee
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 8552 - 8568
  • [7] Research on cooperative collision avoidance problem of multiple UAV based on Reinforcement Learning
    Fang Bin
    Feng XiaoFeng
    Xu Shuo
    [J]. 2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 103 - 109
  • [8] A Method of Multi-UAV Cooperative Task Assignment Based on Reinforcement Learning
    Zhao, Xiaohu
    Jiang, Hanli
    An, Chenyang
    Wu, Ruocheng
    Guo, Yijun
    Yang, Daquan
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [9] Knowledge Distillation based Cooperative Reinforcement Learning for Connectivity Preservation in UAV Networks
    Wang, Zhihong
    Wei, Yunkai
    Wu, Fan
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 171 - 176
  • [10] Application of reinforcement learning in UAV cluster task scheduling
    Yang, Jun
    You, Xinghui
    Wu, Gaoxiang
    Hassan, Mohammad Mehedi
    Almogren, Ahmad
    Guna, Joze
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 140 - 148