Cooperative Pursuit of UAV Cluster Based on Graph Embedding Reinforcement Learning

被引:1
|
作者
Guo, Wan-chun [1 ]
Xie, Wu-jie [1 ]
Dong, Wen-han [1 ]
He, Lei [1 ]
机构
[1] Air Force Engn Univ, Xian, Peoples R China
关键词
UAV cluster; mufti-Agent reinforcement learning; graph neural network;
D O I
10.1109/ICCEIC54227.2021.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the cooperative behavior decision-making problem of UAV cluster cooperative pursuit of escaping UAVs, a cooperative pursuit target decision-making process model based on Dec-POMDP is established.Aiming at the uncertain and partially observable scenes caused by the limited perception and communication ability of UAV cluster system state, a dynamic graph embedding method is proposed.The cluster state embedded in the graph is represented as the network input under the AC framework. Through information fusion, the individuals in the cluster system can perceive the information of the surrounding UAVs, so as to produce the global situation.Based on the idea of centralized evaluation and distributed execution, a multi-agent strategy gradient method for double delay depth determination based on empirical playback region reconstruction is proposed.This method can be effectively combined with the graph embedding method to represent the state of cluster system. The above method is applied to the target pursuit of UAV cluster, and the learning process has good convergence.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [41] ASSEMBLY SEQUENCE OPTIMIZATION OF SPATIAL TRUSSES USING GRAPH EMBEDDING AND REINFORCEMENT LEARNING
    Hayashi, Kazuki
    Ohsaki, Makoto
    Kotera, Masaya
    [J]. JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR SHELL AND SPATIAL STRUCTURES, 2022, 63 (04): : 232 - 240
  • [42] A Cooperative Control Method Based on Reinforcement Learning
    Zhuang, Zhiwei
    Xu, Yongheng
    Zou, Luobao
    Zhang, Weidong
    Zhao, Qunfei
    Yao, Ruiwen
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1586 - 1591
  • [43] Fuzzy PID Controller for UAV Based on Reinforcement Learning
    Zhang, Benyi
    Zhang, Weiping
    Mou, Jiawang
    Yang, Runmin
    Zhang, Yichen
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1724 - 1732
  • [44] Reinforcement-Learning-based Miniature UAV Identification
    She Xiaoyu
    Guan Zhenyu
    Mao Ruizhi
    Li Jie
    Yang Chengwei
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2017, : 237 - 242
  • [45] Energy Management of Hybrid UAV Based on Reinforcement Learning
    Shen, Huan
    Zhang, Yao
    Mao, Jianguo
    Yan, Zhiwei
    Wu, Linwei
    [J]. ELECTRONICS, 2021, 10 (16)
  • [46] Reinforcement Learning Based Topology Control for UAV Networks
    Yoo, Taehoon
    Lee, Sangmin
    Yoo, Kyeonghyun
    Kim, Hwangnam
    [J]. SENSORS, 2023, 23 (02)
  • [47] Reinforcement Learning based Scheduling for Heterogeneous UAV Networking
    Wang, Jian
    Liu, Yongxin
    Niu, Shuteng
    Song, Houbing
    [J]. 2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 420 - 427
  • [48] Survey of Knowledge Graph Based on Reinforcement Learning
    Ma, Ang
    Yu, Yanhua
    Yang, Shengli
    Shi, Chuan
    Li, Jie
    Cai, Xiuxiu
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (08): : 1694 - 1722
  • [49] A Fast and Robust Algorithm with Reinforcement Learning for Large UAV Cluster Mission Planning
    Zuo, Lei
    Gao, Shan
    Li, Yachao
    Li, Lianghai
    Li, Ming
    Lu, Xiaofei
    [J]. REMOTE SENSING, 2022, 14 (06)
  • [50] Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search
    Luo, Quyuan
    Luan, Tom H.
    Shi, Weisong
    Fan, Pingzhi
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (02) : 504 - 520