Cooperative decision-making algorithm with efficient convergence for UCAV formation in beyond-visual-range air combat based on multi-agent reinforcement learning

被引:1
|
作者
Zhou, Yaoming [1 ]
Yang, Fan [1 ]
Zhang, Chaoyue [1 ]
Li, Shida [1 ]
Wang, Yongchao [2 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
[2] Zhejiang Univ, Inst Cyber Syst & Control, Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned combat aerial vehicle (UCAV) formation; Decision-making; Beyond-visual-range (BVR) air combat; Advantage highlight; Multi-agent reinforcement learning (MARL);
D O I
10.1016/j.cja.2024.04.008
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Highly intelligent Unmanned Combat Aerial Vehicle (UCAV) formation is expected to bring out strengths in Beyond-Visual-Range (BVR) air combat. Although Multi-Agent Reinforcement Learning (MARL) shows outstanding performance in cooperative decision-making, it is challenging for existing MARL algorithms to quickly converge to an optimal strategy for UCAV formation in BVR air combat where confrontation is complicated and reward is extremely sparse and delayed. Aiming to solve this problem, this paper proposes an Advantage Highlight MultiAgent Proximal Policy Optimization (AHMAPPO) algorithm. First, at every step, the AHMAPPO records the degree to which the best formation exceeds the average of formations in parallel environments and carries out additional advantage sampling according to it. Then, the sampling result is introduced into the updating process of the actor network to improve its optimization efficiency. Finally, the simulation results reveal that compared with some state-of-the-art MARL algorithms, the AHMAPPO can obtain a more excellent strategy utilizing fewer sample episodes in the UCAV formation BVR air combat simulation environment built in this paper, which can reflect the critical features of BVR air combat. The AHMAPPO can significantly increase the convergence efficiency
引用
收藏
页码:311 / 328
页数:18
相关论文
共 50 条
  • [31] Enhancing Situation Awareness in Beyond Visual Range Air Combat with Reinforcement Learning-based Decision Support
    Scukins, Edvards
    Klein, Markus
    Ogren, Petter
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 56 - 62
  • [32] Multi-agent Decision-making at Unsignalized Intersections with Reinforcement Learning from Demonstrations
    Huang, Chang
    Zhao, Junqiao
    Zhou, Hongtu
    Zhang, Hai
    Zhang, Xiao
    Ye, Chen
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [33] LLM-guided decision-making toolkit for multi-agent reinforcement learning
    Li, Zhemin
    Zhang, Ruobing
    Wang, Zhengming
    Xie, Zheng
    Song, Yiping
    NEUROCOMPUTING, 2025, 638
  • [34] Multi-intent autonomous decision-making for air combat with deep reinforcement learning
    Luyu Jia
    Chengtao Cai
    Xingmei Wang
    Zhengkun Ding
    Junzheng Xu
    Kejun Wu
    Jiaqi Liu
    Applied Intelligence, 2023, 53 : 29076 - 29093
  • [35] Multi-intent autonomous decision-making for air combat with deep reinforcement learning
    Jia, Luyu
    Cai, Chengtao
    Wang, Xingmei
    Ding, Zhengkun
    Xu, Junzheng
    Wu, Kejun
    Liu, Jiaqi
    APPLIED INTELLIGENCE, 2023, 53 (23) : 29076 - 29093
  • [36] Collaborative Decision-making in Heterogeneous UAV Swarms based on Multi-agent Deep Reinforcement Learning
    Yang, Feng
    Li, Zhi
    Fu, Jiahao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2138 - 2145
  • [37] Hierarchical multi-agent reinforcement learning for multi-aircraft close-range air combat
    Kong, Wei-ren
    Zhou, De-yun
    Du, Yong-jie
    Zhou, Ying
    Zhao, Yi-yang
    IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (13): : 1840 - 1862
  • [38] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
    Gao, La-mei
    Zeng, Jun
    Wu, Jie
    Li, Min
    2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
  • [39] Decision-Making Strategy Using Multi-Agent Reinforcement Learning for Platoon Formation in Agreement-Seeking Cooperation
    Hyeon, Eunjeong
    Karbowski, Dominik
    Rousseau, Aymeric
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [40] Attack Target Sorting Technology for Beyond Visual Range Air Combat Based on Grey Incidence Decision-Making Method
    Chen, Mou
    Zou, Qing-yuan
    Jiang, Chang-sheng
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 1940 - 1944