Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning

被引:41
|
作者
Hu, Jinwen [1 ]
Wang, Luhe [1 ]
Hu, Tianmi [1 ]
Guo, Chubing [2 ,3 ]
Wang, Yanxiong [4 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] China Elect Technol Grp Corp, Res Inst 20, Key Lab Data Link Technol, Xian 710068, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[4] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610091, Peoples R China
基金
中国国家自然科学基金;
关键词
air combat; maneuver decision; reinforcement learning; priority sampling; situation assessment;
D O I
10.3390/electronics11030467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous maneuver decision making is the core of intelligent warfare, which has become the main research direction to enable unmanned aerial vehicles (UAVs) to independently generate control commands and complete air combat tasks according to environmental situation information. In this paper, an autonomous maneuver decision making method is proposed for air combat by two cooperative UAVs, which is showcased by using the typical olive formation strategy as a practical example. First, a UAV situation assessment model based on the relative situation is proposed, which uses the real-time target and UAV location information to assess the current situation or threat. Second, the continuous air combat state space is discretized into a 13 dimensional space for dimension reduction and quantitative description, and 15 typical action commands instead of a continuous control space are designed to reduce the difficulty of UAV training. Third, a reward function is designed based on the situation assessment which includes the real-time gain due to maneuver and the final combat winning/losing gain. Fourth, an improved training data sampling strategy is proposed, which samples the data in the experience pool based on priority to accelerate the training convergence. Fifth, a hybrid autonomous maneuver decision strategy for dual-UAV olive formation air combat is proposed which realizes the UAV capability of obstacle avoidance, formation and confrontation. Finally, the air combat task of dual-UAV olive formation is simulated and the results show that the proposed method can help the UAVs defeat the enemy effectively and outperforms the deep Q network (DQN) method without priority sampling in terms of the convergence speed.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    ZHANG Jiandong
    YANG Qiming
    SHI Guoqing
    LU Yi
    WU Yong
    [J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
  • [2] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [3] Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning
    Yang, Qiming
    Zhang, Jiandong
    Shi, Guoqing
    Hu, Jinwen
    Wu, Yong
    [J]. IEEE ACCESS, 2020, 8 : 363 - 378
  • [4] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
    Li, Ke
    Zhang, Kun
    Zhang, Zhenchong
    Liu, Zekun
    Hua, Shuai
    He, Jianliang
    [J]. SENSORS, 2021, 21 (06)
  • [5] Maneuver decision of UCAV in air combat based on deep reinforcement learning
    Li, Yongfeng
    Shi, Jingping
    Zhang, Weiguo
    Jiang, Wei
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
  • [6] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
    Yin, Shuhui
    Kang, Yu
    Zhao, Yunbo
    Xue, Jian
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
  • [7] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
    Tingyu Zhang
    Yongshuai Wang
    Mingwei Sun
    Zengqiang Chen
    [J]. Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
  • [8] UAV Air Combat Autonomous Maneuver Decision Based on DDPG Algorithm
    Yang, Qiming
    Zhu, Yan
    Zhang, Jiandong
    Qiao, Shasha
    Liu, Jieling
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 37 - 42
  • [9] Autonomous guidance maneuver control and decision-making algorithm based on deep reinforcement learning UAV route
    Zhang, Kun
    Li, Ke
    Shi, Haotian
    Zhang, Zhenchong
    Liu, Zekun
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (07): : 1567 - 1574
  • [10] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81