Multi-Unmanned Aerial Vehicle Confrontation in Intelligent Air Combat: A Multi-Agent Deep Reinforcement Learning Approach

被引:0
|
作者
Yang, Jianfeng [1 ]
Yang, Xinwei [2 ]
Yu, Tianqi [1 ]
机构
[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou 215006, Peoples R China
[2] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-UAV confrontation; intelligent decision-making; multi-agent deep reinforcement learning; DECISION-MAKING;
D O I
10.3390/drones8080382
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multiple unmanned aerial vehicle (multi-UAV) confrontation is becoming an increasingly important combat mode in intelligent air combat. The confrontation highly relies on the intelligent collaboration and real-time decision-making of the UAVs. Thus, a decomposed and prioritized experience replay (PER)-based multi-agent deep deterministic policy gradient (DP-MADDPG) algorithm has been proposed in this paper for the moving and attacking decisions of UAVs. Specifically, the confrontation is formulated as a partially observable Markov game. To solve the problem, the DP-MADDPG algorithm is proposed by integrating the decomposed and PER mechanisms into the traditional MADDPG. To overcome the technical challenges of the convergence to a local optimum and a single dominant policy, the decomposed mechanism is applied to modify the MADDPG framework with local and global dual critic networks. Furthermore, to improve the convergence rate of the MADDPG training process, the PER mechanism is utilized to optimize the sampling efficiency from the experience replay buffer. Simulations have been conducted based on the Multi-agent Combat Arena (MaCA) platform, wherein the traditional MADDPG and independent learning DDPG (ILDDPG) algorithms are benchmarks. Simulation results indicate that the proposed DP-MADDPG improves the convergence rate and the convergent reward value. During confrontations against the vanilla distance-prioritized rule-empowered and intelligent ILDDPG-empowered blue parties, the DP-MADDPG-empowered red party can improve the win rate to 96% and 80.5%, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
    Gong, Zihao
    Xu, Yang
    Luo, Delin
    UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
  • [2] Three-Dimensional Trajectory and Resource Allocation Optimization in Multi-Unmanned Aerial Vehicle Multicast System: A Multi-Agent Reinforcement Learning Method
    Wang, Dongyu
    Liu, Yue
    Yu, Hongda
    Hou, Yanzhao
    DRONES, 2023, 7 (10)
  • [3] Agent Coordination in Air Combat Simulation using Multi-Agent Deep Reinforcement Learning
    Kallstrom, Johan
    Heintz, Fredrik
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2157 - 2164
  • [4] Multi-agent Reinforcement Learning for Unmanned Aerial Vehicle Capture-the-Flag Game Behavior
    Jacob, Tobias
    Duran, Daniel
    Pfeiffer, Trey
    Vignati, Micael
    Johnson, Matthew
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 174 - 186
  • [5] Adaptive control for multi-agent systems with actuator fault via reinforcement learning and its application on multi-unmanned surface vehicle
    Bai, Weiwei
    Zhang, Wenjun
    Cao, Liang
    Liu, Qiang
    OCEAN ENGINEERING, 2023, 280
  • [6] Hierarchical Multi-Agent Reinforcement Learning for Air Combat Maneuvering
    Selmonaj, Ardian
    Szehr, Oleg
    Del Rio, Giacomo
    Antonucci, Alessandro
    Schneider, Adrian
    Ruegsegger, Michael
    22ND IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA 2023, 2023, : 1031 - 1038
  • [7] Multi-unmanned aerial vehicle cooperative air combat gaming based on graph model for conflict resolution
    Huang Y.
    Ge B.
    Hou Z.
    Yang K.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (09): : 2714 - 2725
  • [8] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
    Wang, Zhi
    Liu, Fan
    Guo, Jing
    Hong, Chen
    Chen, Ming
    Wang, Ershen
    Zhao, Yunbo
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
  • [9] A Data-Driven Packet Routing Algorithm for an Unmanned Aerial Vehicle Swarm: A Multi-Agent Reinforcement Learning Approach
    Qiu, Xiulin
    Xu, Lei
    Wang, Ping
    Yang, Yuwang
    Liao, Zhenqiang
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (10) : 2160 - 2164
  • [10] Multi-unmanned aerial vehicle multi acoustic source localization
    Manickam, Suresh
    Swar, Sufal Chandra
    Casbeer, David W.
    Manyam, Satyanarayana Gupta
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2021, 235 (03) : 273 - 294