Cooperative Pursuit of UAV Cluster Based on Graph Embedding Reinforcement Learning

被引:1
|
作者
Guo, Wan-chun [1 ]
Xie, Wu-jie [1 ]
Dong, Wen-han [1 ]
He, Lei [1 ]
机构
[1] Air Force Engn Univ, Xian, Peoples R China
关键词
UAV cluster; mufti-Agent reinforcement learning; graph neural network;
D O I
10.1109/ICCEIC54227.2021.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the cooperative behavior decision-making problem of UAV cluster cooperative pursuit of escaping UAVs, a cooperative pursuit target decision-making process model based on Dec-POMDP is established.Aiming at the uncertain and partially observable scenes caused by the limited perception and communication ability of UAV cluster system state, a dynamic graph embedding method is proposed.The cluster state embedded in the graph is represented as the network input under the AC framework. Through information fusion, the individuals in the cluster system can perceive the information of the surrounding UAVs, so as to produce the global situation.Based on the idea of centralized evaluation and distributed execution, a multi-agent strategy gradient method for double delay depth determination based on empirical playback region reconstruction is proposed.This method can be effectively combined with the graph embedding method to represent the state of cluster system. The above method is applied to the target pursuit of UAV cluster, and the learning process has good convergence.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [31] Multi-UAV Cooperative Search Based on Reinforcement Learning With a Digital Twin Driven Training Framework
    Shen, Gaoqing
    Lei, Lei
    Zhang, Xinting
    Li, Zhilin
    Cai, Shengsuo
    Zhang, Lijuan
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (07) : 8354 - 8368
  • [32] Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks
    Luong, Phuong
    Gagnon, Francois
    Tran, Le-Nam
    Labeau, Fabrice
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) : 7610 - 7625
  • [33] iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
    Maw, Aye Aye
    Tyan, Maxim
    Nguyen, Tuan Anh
    Lee, Jae-Woo
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [34] Virtual Network Embedding with Changeable Action Space: An Approach based on Graph Neural Network and Reinforcement Learning
    Tan, Yawen
    Wang, Jiadai
    Liu, Jiajia
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3646 - 3651
  • [35] Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games
    Lou, Xingzhou
    Zhang, Junge
    Du, Yali
    Yu, Chao
    He, Zhaofeng
    Huang, Kaiqi
    [J]. IEEE TRANSACTIONS ON GAMES, 2024, 16 (02) : 470 - 482
  • [36] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
    Dai, Anna
    Li, Rongpeng
    Zhaot, Zhifeng
    Zhang, Honggang
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
  • [37] DISCOVERING UNPRECEDENTED HEURISTICS FOR HUB IDENTIFICATION BY JOINT GRAPH EMBEDDING AND REINFORCEMENT LEARNING
    Kim, Minjeong
    Yang, Defu
    Wu, Guorong
    [J]. 2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1573 - 1576
  • [38] ACR-GNN: Adaptive Cluster Reinforcement Graph Neural Network Based on Contrastive Learning
    Jianpeng Hu
    Shengfu Ning
    Meng Yan
    Yifan Cao
    Zhishen Nie
    Ying Lin
    [J]. Neural Processing Letters, 2023, 55 : 8215 - 8236
  • [39] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
    Li J.
    Shi Y.
    Yang Y.
    Li B.
    Zhao X.
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309
  • [40] ASSEMBLY SEQUENCE OPTIMIZATION OF SPATIAL TRUSSES USING GRAPH EMBEDDING AND REINFORCEMENT LEARNING
    Hayashi, Kazuki
    Ohsaki, Makoto
    Kotera, Masaya
    [J]. JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR SHELL AND SPATIAL STRUCTURES, 2022, 63 (04): : 232 - 240