Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction

被引:0
|
作者
Xinwei Wang
Yihui Wang
Xichao Su
Lei Wang
Chen Lu
Haijun Peng
Jie Liu
机构
[1] Dalian University of Technology,Department of Engineering Mechanics, State Key Laboratory of Structural Analysis, Optimization and CAE Software for Industrial Equipment
[2] Naval Aviation University,School of Mathematical Science
[3] Dalian University of Technology,School of Reliability and Systems Engineering
[4] Science and Technology on Reliability and Environmental Engineering Laboratory,Institute of Reliability Engineering
[5] Beihang University,War Research Institute
[6] Beihang University,undefined
[7] Academy of Military Sciences,undefined
来源
关键词
Artificial intelligence; Unmanned aerial vehicle (UAV); Deep reinforcement learning (DRL); Air combat maneuver decision-making (ACMD);
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, various innovative air combat paradigms that rely on unmanned aerial vehicles (UAVs), i.e., UAV swarm and UAV-manned aircraft cooperation, have received great attention worldwide. During the operation, UAVs are expected to perform agile and safe maneuvers according to the dynamic mission requirement and complicated battlefield environment. Deep reinforcement learning (DRL), which is suitable for sequential decision-making process, provides a powerful solution tool for air combat maneuver decision-making (ACMD), and hundreds of related research papers have been published in the last five years. However, as an emerging topic, there lacks a systematic review and tutorial. For this reason, this paper first provides a comprehensive literature review to help people grasp a whole picture of this field. It starts from the DRL itself and then extents to its application in ACMD. And special attentions are given to the design of reward function, which is the core of DRL-based ACMD. Then, a maneuver decision-making method based on one-to-one dogfight scenarios is proposed to enable UAV to win short-range air combat. The model establishment, program design, training methods and performance evaluation are described in detail. And the associated Python codes are available at gitee.com/wangyyhhh, thus enabling a quick-start for researchers to build their own ACMD applications by slight modifications. Finally, limitations of the considered model, as well as the possible future research direction for intelligent air combat, are also discussed.
引用
收藏
相关论文
共 50 条
  • [31] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
  • [32] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    ZHANG Jiandong
    YANG Qiming
    SHI Guoqing
    LU Yi
    WU Yong
    [J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
  • [33] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [34] 2-D Air Combat Maneuver Decision Using Reinforcement Learning
    Tasbas, Ahmet Semih
    Aydinli, Sevket Utku
    [J]. 2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 740 - 745
  • [35] Explainable, Deep Reinforcement Learning-Based Decision Making for Operations and Maintenance
    Spangler, Ryan M.
    Raeisinezhad, Mahsa
    Cole, Daniel G.
    [J]. NUCLEAR TECHNOLOGY, 2024,
  • [36] Research on Autonomous Decision-Making of UCAV Based on Deep Reinforcement Learning
    Wang, Linxiang
    Wei, Hongtao
    [J]. 2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 122 - 126
  • [37] Close air combat maneuver decision based on deep stochastic game
    Ma, Wen
    Li, Hui
    Wang, Zhuang
    Huang, Zhiyong
    Wu, Zhaoxin
    Chen, Xiliang
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (02): : 443 - 451
  • [38] Reinforcement learning for decision-making under deep uncertainty
    Pei, Zhihao
    Rojas-Arevalo, Angela M.
    de Haan, Fjalar J.
    Lipovetzky, Nir
    Moallemi, Enayat A.
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 359
  • [39] A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making
    Xie, RongLei
    Huang, ChengJing
    Wang, Ziyi
    Han, Jin
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3884 - 3894
  • [40] Deep Learning-based Decision-Making Model for the Submarine Evade Movement
    Ping, Huang
    Aiping, Huang
    Linwei, Tao
    [J]. OCEANS 2021: SAN DIEGO - PORTO, 2021,