Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction

被引:26
|
作者
Kong, Weiren [1 ]
Zhou, Deyun [1 ]
Yang, Zhen [1 ]
Zhang, Kai [1 ]
Zeng, Lina [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 15期
基金
中国国家自然科学基金;
关键词
air combat; multi-agent deep reinforcement learning; maneuver strategy; network training; unmanned combat aerial vehicle;
D O I
10.3390/app10155198
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the development of unmanned combat air vehicles (UCAVs) and artificial intelligence (AI), within visual range (WVR) air combat confrontations utilizing intelligent UCAVs are expected to be widely used in future air combats. As controlling highly dynamic and uncertain WVR air combats from the ground stations of the UCAV is not feasible, it is necessary to develop an algorithm that can generate highly intelligent air combat strategies in order to enable UCAV to independently complete air combat missions. In this paper, a 1-vs.-1 WVR air combat strategy generation algorithm is proposed using the multi-agent deep deterministic policy gradient (MADDPG). A 1-vs.-1 WVR air combat is modeled as a two-player zero-sum Markov game (ZSMG). A method for predicting the position of the target is introduced into the model in order to enable the UCAV to predict the target's actions and position. Moreover, to ensure that the UCAV is not limited by the constraints of the basic fighter maneuver (BFM) library, the action space is considered to be a continuous one. At the same time, a potential-based reward shaping method is proposed in order to improve the efficiency of the air combat strategy generation algorithm. Finally, the efficiency of the air combat strategy generation algorithm and the intelligence level of the resulting strategy is verified through simulation experiments. The results show that an air combat strategy using target position prediction is superior to the one that does not use target position prediction.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    ZHANG Jiandong
    YANG Qiming
    SHI Guoqing
    LU Yi
    WU Yong
    [J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
  • [2] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [3] Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning
    Kong, Weiren
    Zhou, Deyun
    Zhang, Kai
    Yang, Zhen
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 506 - 512
  • [4] Maneuver decision of UCAV in air combat based on deep reinforcement learning
    Li, Yongfeng
    Shi, Jingping
    Zhang, Weiguo
    Jiang, Wei
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
  • [5] Cooperative decision-making algorithm with efficient convergence for UCAV formation in beyond-visual-range air combat based on multi-agent reinforcement learning
    Zhou, Yaoming
    Yang, Fan
    Zhang, Chaoyue
    Li, Shida
    Wang, Yongchao
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (08) : 311 - 328
  • [6] Evasive Maneuver Strategy for UCAV in Beyond-Visual-Range Air Combat Based on Hierarchical Multi-Objective Evolutionary Algorithm
    Yang, Zhen
    Zhou, Deyun
    Piao, Haiyin
    Zhang, Kai
    Kong, Weiren
    Pan, Qian
    [J]. IEEE ACCESS, 2020, 8 : 46605 - 46623
  • [7] Cooperative decision-making algorithm with beyond-visual-range air combat based on multi-agent reinforcement learning
    Yaoming ZHOU
    Fan YANG
    Chaoyue ZHANG
    Shida LI
    Yongchao WANG
    [J]. Chinese Journal of Aeronautics, 2024, 37 (08) : 311 - 328
  • [8] Hierarchical Multi-Agent Reinforcement Learning for Air Combat Maneuvering
    Selmonaj, Ardian
    Szehr, Oleg
    Del Rio, Giacomo
    Antonucci, Alessandro
    Schneider, Adrian
    Ruegsegger, Michael
    [J]. 22ND IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA 2023, 2023, : 1031 - 1038
  • [9] UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring
    Zhiqiang ZHENG
    Chen WEI
    Haibin DUAN
    [J]. ScienceChina(InformationSciences), 2024, 67 (08) : 49 - 66
  • [10] UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring
    Zheng, Zhiqiang
    Wei, Chen
    Duan, Haibin
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (08)