Short-range air combat maneuver decision of UAV swarm based on multi-agent Transformer introducing virtual objects

被引:12
|
作者
Jiang, Feilong [1 ]
Xu, Minqiang [1 ]
Li, Yuqing [1 ]
Cui, Hutao [1 ]
Wang, Rixin [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150090, Peoples R China
关键词
UAV swarm; Maneuver decision; Short-range air combat; Multi-agent Transformer; Virtual object; Reinforcement learning;
D O I
10.1016/j.engappai.2023.106358
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of Unmanned Aerial Vehicle (UAV) swarm technology, there has been a growing interest in using Artificial Intelligence (AI) to drive UAV swarms for short-range air combat. However, due to the complexity of situation information in UAV swarm air combat, making accurate air combat decisions based on air combat situation information has become a challenge. In this paper, we propose the multi-agent Transformer introducing virtual objects (MTVO) to address this issue. First, the proposed approach designs a multi-agent Transformer network structure by exploiting the homogeneity feature of the UAV state information in the swarm. This structure enables the structured processing of complex situation information. Specifically, the local situation information of each UAV is calculated by self-attention, which reduces the size of the swarm situation information while retaining the information of key UAVs. This approach reduces the difficulty of processing UAV swarm situation information. Moreover, we add a virtual object to the UAV swarm information to assist in calculating the weight distribution of the local situation. The weighted fusion of local situations allows us to obtain a more effective representation of the global situation, which serves as the basis for more accurate air combat maneuver decisions. We demonstrate the performance of the proposed method through air combat simulation results using Reinforcement Learning (RL) methods, and validate the applicability and effectiveness of the MTVO network for the short-range air combat problem of UAV swarms.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Multi-UAV Redeployment Optimization Based on Multi-Agent Deep Reinforcement Learning Oriented to Swarm Performance Restoration
    Wu, Qilong
    Geng, Zitao
    Ren, Yi
    Feng, Qiang
    Zhong, Jilong
    SENSORS, 2023, 23 (23)
  • [42] Multi-Dimensional Decision-Making for UAV Air Combat Based on Hierarchical Reinforcement Learning
    Zhang J.
    Wang D.
    Yang Q.
    Shi G.
    Lu Y.
    Zhang Y.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1547 - 1563
  • [43] Multi-agent Decision Model and Application Based on Recurrent Neural Network and Particle Swarm Optimization
    Li, Ming
    Liu, Wei-bing
    Wang, Xian-jia
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 469 - 473
  • [44] Method of Formation Cooperative Air Defense Decision Based on Multi-agent System Cooperation
    Wang, Chao
    Wang, Bo
    Zhang, Guo
    Liang, Yizhi
    COMMUNICATIONS AND INFORMATION PROCESSING, PT 2, 2012, 289 : 529 - 538
  • [45] Cooperative Occupancy Decision Making of Multi-UAV in Beyond-Visual-Range Air Combat: A Game Theory Approach
    Ma, Yingying
    Wang, Guoqiang
    Hu, Xiaoxuan
    Luo, He
    Lei, Xing
    IEEE ACCESS, 2020, 8 : 11624 - 11634
  • [46] Collaborative Decision-making in Heterogeneous UAV Swarms based on Multi-agent Deep Reinforcement Learning
    Yang, Feng
    Li, Zhi
    Fu, Jiahao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2138 - 2145
  • [47] Mean policy-based proximal policy optimization for maneuvering decision in multi-UAV air combat
    Zheng, Yifan
    Xin, Bin
    He, Bin
    Ding, Yulong
    Neural Computing and Applications, 2024, 36 (31) : 19667 - 19690
  • [48] Swarm and Multi-agent Time-based A* Path Planning for Lighter-Than-Air Systems
    Gibson, Jason
    Schuler, Tristan
    McGuire, Loy
    Lofaro, Daniel M.
    Sofge, Donald
    UNMANNED SYSTEMS, 2020, 8 (03) : 253 - 260
  • [49] Multi-UAV air combat cooperative game based on virtual opponent and value attention decomposition policy gradient
    Xu, Xiaojie
    Wang, Yunfan
    Guo, Xian
    Huang, Kuihua
    Zhang, Xuebo
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [50] Vision-Based 3D Aerial Target Detection and Tracking for Maneuver Decision in Close-Range Air Combat
    Zhong, Leisheng
    Zhao, Leiming
    Ding, Chencong
    Ge, Xueshi
    Chen, Jialin
    Zhang, Yu
    Zhang, Li
    IEEE ACCESS, 2022, 10 : 4157 - 4168