Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction

被引:29
|
作者
Kong, Weiren [1 ]
Zhou, Deyun [1 ]
Yang, Zhen [1 ]
Zhang, Kai [1 ]
Zeng, Lina [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 15期
基金
中国国家自然科学基金;
关键词
air combat; multi-agent deep reinforcement learning; maneuver strategy; network training; unmanned combat aerial vehicle;
D O I
10.3390/app10155198
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the development of unmanned combat air vehicles (UCAVs) and artificial intelligence (AI), within visual range (WVR) air combat confrontations utilizing intelligent UCAVs are expected to be widely used in future air combats. As controlling highly dynamic and uncertain WVR air combats from the ground stations of the UCAV is not feasible, it is necessary to develop an algorithm that can generate highly intelligent air combat strategies in order to enable UCAV to independently complete air combat missions. In this paper, a 1-vs.-1 WVR air combat strategy generation algorithm is proposed using the multi-agent deep deterministic policy gradient (MADDPG). A 1-vs.-1 WVR air combat is modeled as a two-player zero-sum Markov game (ZSMG). A method for predicting the position of the target is introduced into the model in order to enable the UCAV to predict the target's actions and position. Moreover, to ensure that the UCAV is not limited by the constraints of the basic fighter maneuver (BFM) library, the action space is considered to be a continuous one. At the same time, a potential-based reward shaping method is proposed in order to improve the efficiency of the air combat strategy generation algorithm. Finally, the efficiency of the air combat strategy generation algorithm and the intelligence level of the resulting strategy is verified through simulation experiments. The results show that an air combat strategy using target position prediction is superior to the one that does not use target position prediction.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Multi-Unmanned Aerial Vehicle Confrontation in Intelligent Air Combat: A Multi-Agent Deep Reinforcement Learning Approach
    Yang, Jianfeng
    Yang, Xinwei
    Yu, Tianqi
    DRONES, 2024, 8 (08)
  • [32] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
    Li J.
    Shi Y.
    Yang Y.
    Li B.
    Zhao X.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309
  • [33] Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
    Kong W.-R.
    Zhou D.-Y.
    Zhao Y.-Y.
    Yang W.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 352 - 362
  • [34] Multi-agent target search strategy optimization: Hierarchical reinforcement learning with multi-criteria negative feedback*
    Cao, Xin
    Luo, He
    Tai, Jianwei
    Jiang, Ruhao
    Wang, Guoqiang
    APPLIED SOFT COMPUTING, 2023, 149
  • [35] A Multi-UCAV cooperative occupation method based on weapon engagement zones for beyond-visual-range air combat
    Wei-hua Li
    Jing-ping Shi
    Yun-yan Wu
    Yue-ping Wang
    Yong-xi Lyu
    Defence Technology, 2022, 18 (06) : 1006 - 1022
  • [36] Target round-up control for multi-agent systems based on reinforcement learning
    Fan Z.
    Yang H.
    Han Y.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44
  • [37] A Multi-UCAV cooperative occupation method based on weapon engagement zones for beyond-visual-range air combat
    Li, Wei-hua
    Shi, Jing-ping
    Wu, Yun-yan
    Wang, Yue-ping
    Lyu, Yong-xi
    DEFENCE TECHNOLOGY, 2022, 18 (06) : 1006 - 1022
  • [38] A Data Enhancement Strategy for Multi-Agent Cooperative Hunting based on Deep Reinforcement Learning
    Gao, Zhenkun
    Dai, Xiaoyan
    Yao, Meibao
    Xiao, Xueming
    2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
  • [39] Edge server deployment strategy based on multi-agent reinforcement learning in the internet of vehicles
    Li, Chuang
    Ji, Jianqiao
    Hu, Zhigang
    Zhou, Zhou
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2024, 55 (07): : 2567 - 2577
  • [40] Cooperative countermeasure strategy based on active risk defense multi-agent reinforcement learning
    Sun H.-H.
    Hu C.-H.
    Zhang J.-G.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1420 - 1429