Cooperative decision-making algorithm with efficient convergence for UCAV formation in beyond-visual-range air combat based on multi-agent reinforcement learning

被引:1
|
作者
Zhou, Yaoming [1 ]
Yang, Fan [1 ]
Zhang, Chaoyue [1 ]
Li, Shida [1 ]
Wang, Yongchao [2 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
[2] Zhejiang Univ, Inst Cyber Syst & Control, Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned combat aerial vehicle (UCAV) formation; Decision-making; Beyond-visual-range (BVR) air combat; Advantage highlight; Multi-agent reinforcement learning (MARL);
D O I
10.1016/j.cja.2024.04.008
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Highly intelligent Unmanned Combat Aerial Vehicle (UCAV) formation is expected to bring out strengths in Beyond-Visual-Range (BVR) air combat. Although Multi-Agent Reinforcement Learning (MARL) shows outstanding performance in cooperative decision-making, it is challenging for existing MARL algorithms to quickly converge to an optimal strategy for UCAV formation in BVR air combat where confrontation is complicated and reward is extremely sparse and delayed. Aiming to solve this problem, this paper proposes an Advantage Highlight MultiAgent Proximal Policy Optimization (AHMAPPO) algorithm. First, at every step, the AHMAPPO records the degree to which the best formation exceeds the average of formations in parallel environments and carries out additional advantage sampling according to it. Then, the sampling result is introduced into the updating process of the actor network to improve its optimization efficiency. Finally, the simulation results reveal that compared with some state-of-the-art MARL algorithms, the AHMAPPO can obtain a more excellent strategy utilizing fewer sample episodes in the UCAV formation BVR air combat simulation environment built in this paper, which can reflect the critical features of BVR air combat. The AHMAPPO can significantly increase the convergence efficiency
引用
收藏
页码:311 / 328
页数:18
相关论文
共 50 条
  • [41] UAV maneuver decision-making via deep reinforcement learning for short-range air combat
    Zheng, Zhiqiang
    Duan, Haibin
    INTELLIGENCE & ROBOTICS, 2023, 3 (01): : 76 - 94
  • [42] Decision-Making Strategies for Close-Range Air Combat Based on Reinforcement Learning with Variable-Scale Actions
    Wang, Lixin
    Wang, Jin
    Liu, Hailiang
    Yue, Ting
    AEROSPACE, 2023, 10 (05)
  • [43] Autonomous maneuver decision-making for a UCAV in short-range aerial combat based on an MS-DDQN algorithm
    Li, Yong-feng
    Shi, Jing-ping
    Jiang, Wei
    Zhang, Wei-guo
    Lyu, Yong-xi
    DEFENCE TECHNOLOGY, 2022, 18 (09) : 1697 - 1714
  • [44] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
    Li J.
    Shi Y.
    Yang Y.
    Li B.
    Zhao X.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309
  • [45] Utility-based sequential decision-making in evidential cooperative multi-agent systems
    Rogova, G
    Lollett, C
    Scott, P
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 823 - 830
  • [46] Intelligent maneuvering decision-making in two-UCAV cooperative air combat based on improved MADDPG with hybrid hyper network
    Li, Wentao
    Fang, Feng
    Wang, Zhenya
    Zhu, Yichao
    Peng, Dongliang
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (17):
  • [47] Autonomous Dogfight Decision-Making for Air Combat Based on Reinforcement Learning with Automatic Opponent Sampling
    Chen, Can
    Song, Tao
    Mo, Li
    Lv, Maolong
    Lin, Defu
    AEROSPACE, 2025, 12 (03)
  • [48] Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Interactive Traffic Scenarios
    Liu, Qi
    Li, Zirui
    Li, Xueyuan
    Wu, Jingda
    Yuan, Shihua
    IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, 2022, 2022-October : 4074 - 4081
  • [49] Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Interactive Traffic Scenarios
    Liu, Qi
    Li, Zirui
    Li, Xueyuan
    Wu, Jingda
    Yuan, Shihua
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 4074 - 4081
  • [50] Multi-Agent Reinforcement Learning-Based Trading Decision-Making in Platooning-Assisted Vehicular Networks
    Xiao, Tingting
    Chen, Chen
    Dong, Mianxiong
    Ota, Kaoru
    Liu, Lei
    Dustdar, Schahram
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (03) : 2143 - 2158