Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning

被引：39

作者：

Hu, Jinwen ^{[1
]}

Wang, Luhe ^{[1
]}

Hu, Tianmi ^{[1
]}

Guo, Chubing ^{[2
,3
]}

Wang, Yanxiong ^{[4
]}

机构：

[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China

[2] China Elect Technol Grp Corp, Res Inst 20, Key Lab Data Link Technol, Xian 710068, Peoples R China

[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

[4] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610091, Peoples R China

来源：

ELECTRONICS | 2022年 / 11卷 / 03期

基金：

中国国家自然科学基金;

关键词：

air combat; maneuver decision; reinforcement learning; priority sampling; situation assessment;

D O I：

10.3390/electronics11030467

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous maneuver decision making is the core of intelligent warfare, which has become the main research direction to enable unmanned aerial vehicles (UAVs) to independently generate control commands and complete air combat tasks according to environmental situation information. In this paper, an autonomous maneuver decision making method is proposed for air combat by two cooperative UAVs, which is showcased by using the typical olive formation strategy as a practical example. First, a UAV situation assessment model based on the relative situation is proposed, which uses the real-time target and UAV location information to assess the current situation or threat. Second, the continuous air combat state space is discretized into a 13 dimensional space for dimension reduction and quantitative description, and 15 typical action commands instead of a continuous control space are designed to reduce the difficulty of UAV training. Third, a reward function is designed based on the situation assessment which includes the real-time gain due to maneuver and the final combat winning/losing gain. Fourth, an improved training data sampling strategy is proposed, which samples the data in the experience pool based on priority to accelerate the training convergence. Fifth, a hybrid autonomous maneuver decision strategy for dual-UAV olive formation air combat is proposed which realizes the UAV capability of obstacle avoidance, formation and confrontation. Finally, the air combat task of dual-UAV olive formation is simulated and the results show that the proposed method can help the UAVs defeat the enemy effectively and outperforms the deep Q network (DQN) method without priority sampling in terms of the convergence speed.

引用

页数：22

共 50 条

[31] UAV Autonomous Air Combat Decision-making Based on AM-SAC
Li, Zenglin
Li, Bo
Bai, Shuangxia
Meng, Bobo
[J]. Binggong Xuebao/Acta Armamentarii, 2023, 44 (09): : 2849 - 2858
[32] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
Wang, Huan
Wang, Jintao
[J]. SCIENTIFIC REPORTS, 2024, 14 (01)
[33] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning
Huan Wang
Jintao Wang
[J]. Scientific Reports, 14
[34] Deep Reinforcement Learning-Based Air-to-Air Combat Maneuver Generation in a Realistic Environment
Bae, Jung Ho
Jung, Hoseong
Kim, Seogbong
Kim, Sungho
Kim, Yong-Duk
[J]. IEEE ACCESS, 2023, 11 : 26427 - 26440
[35] Research on UAV Air Combat Maneuver Decision Based on Decision Tree CART Algorithm
Liu, Haotian
Jin, Jiangfeng
Liu, Kun
Zhang, Jiaping
Niu, Yanan
[J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2638 - 2650
[36] 2-D Air Combat Maneuver Decision Using Reinforcement Learning
Tasbas, Ahmet Semih
Aydinli, Sevket Utku
[J]. 2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 740 - 745
[37] UAV air combat autonomous trajectory planning method based on robust adversarial reinforcement learning
Wang, Lixin
Zheng, Sizhuang
Tai, Shang
Liu, Hailiang
Yue, Ting
[J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 153
[38] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
Gong, Zihao
Xu, Yang
Luo, Delin
[J]. UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
[39] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
Zhang, Xianbing
Liu, Guoqing
Yang, Chaojie
Wu, Jiang
[J]. ELECTRONICS, 2018, 7 (11):
[40] An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat
Baykal, Yasin
Baspinar, Baris
[J]. 2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,

← 1 2 3 4 5 →