Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction

被引:26
|
作者
Kong, Weiren [1 ]
Zhou, Deyun [1 ]
Yang, Zhen [1 ]
Zhang, Kai [1 ]
Zeng, Lina [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 15期
基金
中国国家自然科学基金;
关键词
air combat; multi-agent deep reinforcement learning; maneuver strategy; network training; unmanned combat aerial vehicle;
D O I
10.3390/app10155198
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the development of unmanned combat air vehicles (UCAVs) and artificial intelligence (AI), within visual range (WVR) air combat confrontations utilizing intelligent UCAVs are expected to be widely used in future air combats. As controlling highly dynamic and uncertain WVR air combats from the ground stations of the UCAV is not feasible, it is necessary to develop an algorithm that can generate highly intelligent air combat strategies in order to enable UCAV to independently complete air combat missions. In this paper, a 1-vs.-1 WVR air combat strategy generation algorithm is proposed using the multi-agent deep deterministic policy gradient (MADDPG). A 1-vs.-1 WVR air combat is modeled as a two-player zero-sum Markov game (ZSMG). A method for predicting the position of the target is introduced into the model in order to enable the UCAV to predict the target's actions and position. Moreover, to ensure that the UCAV is not limited by the constraints of the basic fighter maneuver (BFM) library, the action space is considered to be a continuous one. At the same time, a potential-based reward shaping method is proposed in order to improve the efficiency of the air combat strategy generation algorithm. Finally, the efficiency of the air combat strategy generation algorithm and the intelligence level of the resulting strategy is verified through simulation experiments. The results show that an air combat strategy using target position prediction is superior to the one that does not use target position prediction.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] UAV intelligent attack strategy generation model based on multi-agent game reinforcement learning
    Zhao Z.
    Cao L.
    Chen X.
    Lai J.
    Zhang L.
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3165 - 3171
  • [22] Short-range air combat maneuver decision of UAV swarm based on multi-agent Transformer introducing virtual objects
    Jiang, Feilong
    Xu, Minqiang
    Li, Yuqing
    Cui, Hutao
    Wang, Rixin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [23] Beyond-Visual-Range Air Combat Tactics Auto-Generation by Reinforcement Learning
    Piao, Haiyin
    Sun, Zhixiao
    Meng, Guanglei
    Chen, Hechang
    Qu, Bohao
    Lang, Kuijun
    Sun, Yang
    Yang, Shengqi
    Peng, Xuanqi
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [24] A Modified Multi-Agent Reinforcement Learning Protocol Based on Prediction for UAANETs
    Li, Chao
    Liu, Jing
    [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [25] Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat
    Zhou, Pan
    Li, Ni
    Huang, Jiangtao
    Zhang, Sheng
    Zhou, Xiaoyu
    Liu, Gang
    [J]. 2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023, 2024, 1051 : 320 - 334
  • [26] Cross coordination of behavior clone and reinforcement learning for autonomous within-visual-range air combat
    Li, Lun
    Zhang, Xuebo
    Qian, Chenxu
    Zhao, Minghui
    Wang, Runhua
    [J]. NEUROCOMPUTING, 2024, 584
  • [27] Virtual Machine Migration Strategy Based on Multi-Agent Deep Reinforcement Learning
    Dai, Yu
    Zhang, Qiuhong
    Yang, Lei
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [28] Deep Reinforcement Learning based Autonomous Air-to-Air Combat using Target Trajectory Prediction
    Yoo, Jaewoong
    Kim, Donghwi
    Shim, David Hyunchul
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 2172 - 2176
  • [29] Prediction of Breast Cancer Pathogenic Genes Based on Multi-agent Reinforcement Learning
    Liu J.
    Gu Y.
    Cheng Y.-H.
    Wang X.-S.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (05): : 1246 - 1258
  • [30] Multi-agent Reinforcement Learning-based Offloading Decision for UAV Cluster Combat Tasks
    Li J.
    Shi Y.
    Yang Y.
    Li B.
    Zhao X.
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (11): : 3295 - 3309