Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning

被引:39
|
作者
Hu, Jinwen [1 ]
Wang, Luhe [1 ]
Hu, Tianmi [1 ]
Guo, Chubing [2 ,3 ]
Wang, Yanxiong [4 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] China Elect Technol Grp Corp, Res Inst 20, Key Lab Data Link Technol, Xian 710068, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[4] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610091, Peoples R China
基金
中国国家自然科学基金;
关键词
air combat; maneuver decision; reinforcement learning; priority sampling; situation assessment;
D O I
10.3390/electronics11030467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous maneuver decision making is the core of intelligent warfare, which has become the main research direction to enable unmanned aerial vehicles (UAVs) to independently generate control commands and complete air combat tasks according to environmental situation information. In this paper, an autonomous maneuver decision making method is proposed for air combat by two cooperative UAVs, which is showcased by using the typical olive formation strategy as a practical example. First, a UAV situation assessment model based on the relative situation is proposed, which uses the real-time target and UAV location information to assess the current situation or threat. Second, the continuous air combat state space is discretized into a 13 dimensional space for dimension reduction and quantitative description, and 15 typical action commands instead of a continuous control space are designed to reduce the difficulty of UAV training. Third, a reward function is designed based on the situation assessment which includes the real-time gain due to maneuver and the final combat winning/losing gain. Fourth, an improved training data sampling strategy is proposed, which samples the data in the experience pool based on priority to accelerate the training convergence. Fifth, a hybrid autonomous maneuver decision strategy for dual-UAV olive formation air combat is proposed which realizes the UAV capability of obstacle avoidance, formation and confrontation. Finally, the air combat task of dual-UAV olive formation is simulated and the results show that the proposed method can help the UAVs defeat the enemy effectively and outperforms the deep Q network (DQN) method without priority sampling in terms of the convergence speed.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] UAV Autonomous Air Combat Decision-making Based on AM-SAC
    Li, Zenglin
    Li, Bo
    Bai, Shuangxia
    Meng, Bobo
    [J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (09): : 2849 - 2858
  • [32] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
    Wang, Huan
    Wang, Jintao
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
    Huan Wang
    Jintao Wang
    [J]. Scientific Reports, 14
  • [34] Deep Reinforcement Learning-Based Air-to-Air Combat Maneuver Generation in a Realistic Environment
    Bae, Jung Ho
    Jung, Hoseong
    Kim, Seogbong
    Kim, Sungho
    Kim, Yong-Duk
    [J]. IEEE ACCESS, 2023, 11 : 26427 - 26440
  • [35] Research on UAV Air Combat Maneuver Decision Based on Decision Tree CART Algorithm
    Liu, Haotian
    Jin, Jiangfeng
    Liu, Kun
    Zhang, Jiaping
    Niu, Yanan
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2638 - 2650
  • [36] 2-D Air Combat Maneuver Decision Using Reinforcement Learning
    Tasbas, Ahmet Semih
    Aydinli, Sevket Utku
    [J]. 2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 740 - 745
  • [37] UAV air combat autonomous trajectory planning method based on robust adversarial reinforcement learning
    Wang, Lixin
    Zheng, Sizhuang
    Tai, Shang
    Liu, Hailiang
    Yue, Ting
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 153
  • [38] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
    Gong, Zihao
    Xu, Yang
    Luo, Delin
    [J]. UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
  • [39] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
    Zhang, Xianbing
    Liu, Guoqing
    Yang, Chaojie
    Wu, Jiang
    [J]. ELECTRONICS, 2018, 7 (11):
  • [40] An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat
    Baykal, Yasin
    Baspinar, Baris
    [J]. 2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,