Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction

被引：0

作者：

Xinwei Wang

Yihui Wang

Xichao Su

Lei Wang

Chen Lu

Haijun Peng

Jie Liu

机构：

[1] Dalian University of Technology,Department of Engineering Mechanics, State Key Laboratory of Structural Analysis, Optimization and CAE Software for Industrial Equipment

[2] Naval Aviation University,School of Mathematical Science

[3] Dalian University of Technology,School of Reliability and Systems Engineering

[4] Science and Technology on Reliability and Environmental Engineering Laboratory,Institute of Reliability Engineering

[5] Beihang University,War Research Institute

[6] Beihang University,undefined

[7] Academy of Military Sciences,undefined

来源：

Artificial Intelligence Review | 2024年 / 57卷

关键词：

Artificial intelligence; Unmanned aerial vehicle (UAV); Deep reinforcement learning (DRL); Air combat maneuver decision-making (ACMD);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Nowadays, various innovative air combat paradigms that rely on unmanned aerial vehicles (UAVs), i.e., UAV swarm and UAV-manned aircraft cooperation, have received great attention worldwide. During the operation, UAVs are expected to perform agile and safe maneuvers according to the dynamic mission requirement and complicated battlefield environment. Deep reinforcement learning (DRL), which is suitable for sequential decision-making process, provides a powerful solution tool for air combat maneuver decision-making (ACMD), and hundreds of related research papers have been published in the last five years. However, as an emerging topic, there lacks a systematic review and tutorial. For this reason, this paper first provides a comprehensive literature review to help people grasp a whole picture of this field. It starts from the DRL itself and then extents to its application in ACMD. And special attentions are given to the design of reward function, which is the core of DRL-based ACMD. Then, a maneuver decision-making method based on one-to-one dogfight scenarios is proposed to enable UAV to win short-range air combat. The model establishment, program design, training methods and performance evaluation are described in detail. And the associated Python codes are available at gitee.com/wangyyhhh, thus enabling a quick-start for researchers to build their own ACMD applications by slight modifications. Finally, limitations of the considered model, as well as the possible future research direction for intelligent air combat, are also discussed.

引用

共 50 条

[31] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
Li, Bo
Huang, Jingyi
Bai, Shuangxia
Gan, Zhigang
Liang, Shiyang
Evgeny, Neretin
Yao, Shouwen
[J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
[32] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
ZHANG Jiandong
YANG Qiming
SHI Guoqing
LU Yi
WU Yong
[J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
[33] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
Zhang Jiandong
Yang Qiming
Shi Guoqing
Lu Yi
Wu Yong
[J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
[34] 2-D Air Combat Maneuver Decision Using Reinforcement Learning
Tasbas, Ahmet Semih
Aydinli, Sevket Utku
[J]. 2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 740 - 745
[35] Explainable, Deep Reinforcement Learning-Based Decision Making for Operations and Maintenance
Spangler, Ryan M.
Raeisinezhad, Mahsa
Cole, Daniel G.
[J]. NUCLEAR TECHNOLOGY, 2024,
[36] Research on Autonomous Decision-Making of UCAV Based on Deep Reinforcement Learning
Wang, Linxiang
Wei, Hongtao
[J]. 2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 122 - 126
[37] Close air combat maneuver decision based on deep stochastic game
Ma, Wen
Li, Hui
Wang, Zhuang
Huang, Zhiyong
Wu, Zhaoxin
Chen, Xiliang
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (02): : 443 - 451
[38] Reinforcement learning for decision-making under deep uncertainty
Pei, Zhihao
Rojas-Arevalo, Angela M.
de Haan, Fjalar J.
Lipovetzky, Nir
Moallemi, Enayat A.
[J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 359
[39] A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making
Xie, RongLei
Huang, ChengJing
Wang, Ziyi
Han, Jin
[J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3884 - 3894
[40] Deep Learning-based Decision-Making Model for the Submarine Evade Movement
Ping, Huang
Aiping, Huang
Linwei, Tao
[J]. OCEANS 2021: SAN DIEGO - PORTO, 2021,

← 1 2 3 4 5 →