UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring

被引:0
|
作者
Zhiqiang ZHENG
Chen WEI
Haibin DUAN
机构
[1] StateKeyLaboratoryofVirtualRealityTechnologyandSystems,SchoolofAutomationScienceandElectricalEngineering,BeihangUniversity
关键词
D O I
暂无
中图分类号
学科分类号
摘要
During short-range air combat involving unmanned aircraft vehicle(UAV) swarms, UAVs must make accurate maneuver decisions based on information from both enemy and friendly UAVs. This dual requirement of competition and cooperation presents a significant challenge in the field of unmanned air combat. In this paper, a method based on multi-agent reinforcement learning(MARL) is proposed to address this issue. An actor network containing three subnetworks that can handle different types of situational information is designed. Hence, the results from simpler one-on-one scenarios are leveraged to enhance the complex swarm air combat training process. Separate state spaces for local and global information are designed for the actor and critic networks. A detailed reward function is proposed to encourage participation.To prevent lazy participants in air combat, a reward assignment operation is applied to distribute these dense rewards. Simulation testing and ablation experiments demonstrate that both the transfer operation and reward assignment operation can effectively deal with the swarm air combat scenario, and reflect the effectiveness of the proposed method.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning
    Kong, Weiren
    Zhou, Deyun
    Zhang, Kai
    Yang, Zhen
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 506 - 512
  • [42] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
    Yin, Shuhui
    Kang, Yu
    Zhao, Yunbo
    Xue, Jian
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
  • [43] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
    Tingyu Zhang
    Yongshuai Wang
    Mingwei Sun
    Zengqiang Chen
    [J]. Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
  • [44] Cooperative maneuver decision making for multi-UAV air combat based on incomplete information dynamic game
    Zhi Ren
    Dong Zhang
    Shuo Tang
    Wei Xiong
    Shu-heng Yang
    [J]. Defence Technology, 2023, 27 (09) : 308 - 317
  • [45] Cooperative maneuver decision making for multi-UAV air combat based on incomplete information dynamic game
    Ren, Zhi
    Zhang, Dong
    Tang, Shuo
    Xiong, Wei
    Yang, Shu-heng
    [J]. DEFENCE TECHNOLOGY, 2023, 27 : 308 - 317
  • [46] Autonomous guidance maneuver control and decision-making algorithm based on deep reinforcement learning UAV route
    Zhang, Kun
    Li, Ke
    Shi, Haotian
    Zhang, Zhenchong
    Liu, Zekun
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (07): : 1567 - 1574
  • [47] A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning
    Tang, Yunlong
    Sun, Jing
    Wang, Huan
    Deng, Junyi
    Tong, Liang
    Xu, Wenhong
    [J]. COMPUTERS & SECURITY, 2024, 142
  • [48] Maneuver Decision-Making for Autonomous Air Combat Based on FRE-PPO
    Zhang, Hongpeng
    Wei, Yujie
    Zhou, Huan
    Huang, Changqiang
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [49] Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction
    Kong, Weiren
    Zhou, Deyun
    Yang, Zhen
    Zhang, Kai
    Zeng, Lina
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [50] Agent Coordination in Air Combat Simulation using Multi-Agent Deep Reinforcement Learning
    Kallstrom, Johan
    Heintz, Fredrik
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2157 - 2164