Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning

被引：41

作者：

Hu, Jinwen ^{[1
]}

Wang, Luhe ^{[1
]}

Hu, Tianmi ^{[1
]}

Guo, Chubing ^{[2
,3
]}

Wang, Yanxiong ^{[4
]}

机构：

[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China

[2] China Elect Technol Grp Corp, Res Inst 20, Key Lab Data Link Technol, Xian 710068, Peoples R China

[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

[4] AVIC Chengdu Aircraft Design & Res Inst, Chengdu 610091, Peoples R China

来源：

ELECTRONICS | 2022年 / 11卷 / 03期

基金：

中国国家自然科学基金;

关键词：

air combat; maneuver decision; reinforcement learning; priority sampling; situation assessment;

D O I：

10.3390/electronics11030467

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous maneuver decision making is the core of intelligent warfare, which has become the main research direction to enable unmanned aerial vehicles (UAVs) to independently generate control commands and complete air combat tasks according to environmental situation information. In this paper, an autonomous maneuver decision making method is proposed for air combat by two cooperative UAVs, which is showcased by using the typical olive formation strategy as a practical example. First, a UAV situation assessment model based on the relative situation is proposed, which uses the real-time target and UAV location information to assess the current situation or threat. Second, the continuous air combat state space is discretized into a 13 dimensional space for dimension reduction and quantitative description, and 15 typical action commands instead of a continuous control space are designed to reduce the difficulty of UAV training. Third, a reward function is designed based on the situation assessment which includes the real-time gain due to maneuver and the final combat winning/losing gain. Fourth, an improved training data sampling strategy is proposed, which samples the data in the experience pool based on priority to accelerate the training convergence. Fifth, a hybrid autonomous maneuver decision strategy for dual-UAV olive formation air combat is proposed which realizes the UAV capability of obstacle avoidance, formation and confrontation. Finally, the air combat task of dual-UAV olive formation is simulated and the results show that the proposed method can help the UAVs defeat the enemy effectively and outperforms the deep Q network (DQN) method without priority sampling in terms of the convergence speed.

引用

页数：22

共 50 条

[1] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
ZHANG Jiandong
YANG Qiming
SHI Guoqing
LU Yi
WU Yong
[J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
[2] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
Zhang Jiandong
Yang Qiming
Shi Guoqing
Lu Yi
Wu Yong
[J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
[3] Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning
Yang, Qiming
Zhang, Jiandong
Shi, Guoqing
Hu, Jinwen
Wu, Yong
[J]. IEEE ACCESS, 2020, 8 : 363 - 378
[4] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
Li, Ke
Zhang, Kun
Zhang, Zhenchong
Liu, Zekun
Hua, Shuai
He, Jianliang
[J]. SENSORS, 2021, 21 (06)
[5] Maneuver decision of UCAV in air combat based on deep reinforcement learning
Li, Yongfeng
Shi, Jingping
Zhang, Weiguo
Jiang, Wei
[J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
[6] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
Yin, Shuhui
Kang, Yu
Zhao, Yunbo
Xue, Jian
[J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
[7] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
Tingyu Zhang
Yongshuai Wang
Mingwei Sun
Zengqiang Chen
[J]. Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
[8] UAV Air Combat Autonomous Maneuver Decision Based on DDPG Algorithm
Yang, Qiming
Zhu, Yan
Zhang, Jiandong
Qiao, Shasha
Liu, Jieling
[J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 37 - 42
[9] Autonomous guidance maneuver control and decision-making algorithm based on deep reinforcement learning UAV route
Zhang, Kun
Li, Ke
Shi, Haotian
Zhang, Zhenchong
Liu, Zekun
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (07): : 1567 - 1574
[10] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
Li, Bo
Huang, Jingyi
Bai, Shuangxia
Gan, Zhigang
Liang, Shiyang
Evgeny, Neretin
Yao, Shouwen
[J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81

← 1 2 3 4 5 →