Research on Intelligent Maneuvering Decision in Close Air Combat Based on Deep Q Network

被引：0

作者：

Zhangl, Tingyu ^{[1
]}

Zheng, Chen ^{[2
]}

Sun, Mingwei ^{[1
]}

Wang, Yongshuai ^{[1
]}

Chen, Zengqiang ^{[1
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China

[2] Beijing Inst Astronaut Syst Engn, Beijing 100076, Peoples R China

来源：

2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS | 2023年

基金：

中国国家自然科学基金;

关键词：

air combat; autonomous maneuvering decision; deep reinforcement learning; DQN; reward function;

D O I：

10.1109/DDCLS58216.2023.10166948

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For the Unmanned Combat Aerial Vehicle(UCAV)maneuvering decision in close air combat, the design of reinforcement learning(RL) reward function and the selection of hyperparameters are studied based on the deep Q network algorithm. Considering the angle, range, altitude, and speed factors, an auxiliary reward function is proposed to solve the sparse reward problem of RL. Meanwhile, aiming at the issue of hyperparameter selection in RL, the influence of learning rate, the number of network nodes, and layers on the decision-making system is explored, and a suitable range of parameters is given, which provides a reference for the subsequent research on parameter selection. In addition, the simulation results show that the trained agent can obtain the optimal maneuver strategy in different air combat situations, but it is sensitive to RL hyperparameters.

引用

页码：1044 / 1049

页数：6

共 50 条

[31] Research on Maneuvering Decision Algorithm Based on Improved Deep Deterministic Policy Gradient
Jing, Xianyong
Hou, Manyi
Wu, Gaolong
Ma, Zongcheng
Tao, Zhongxiang
IEEE ACCESS, 2022, 10 : 92426 - 92445
[32] Manual-Based Automated Maneuvering Decisions for Air-to-Air Combat
Yang, Kwangjin
Kim, Songhyon
Lee, Younggun
Jang, Changyoung
Kim, Yong-Duk
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2024, 21 (01): : 28 - 36
[33] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
Yin, Shuhui
Kang, Yu
Zhao, Yunbo
Xue, Jian
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
[34] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
Zhang T.
Wang Y.
Sun M.
Chen Z.
Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
[35] Reconstruction and evaluation of close air combat decision-making process based on fuzzy clustering
Zuo, Jialiang
Yang, Rennong
Zhang, Ying
Wu, Meng
Xiao, Yuze
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2015, 36 (05): : 1650 - 1660
[36] Mean policy-based proximal policy optimization for maneuvering decision in multi-UAV air combat
Zheng, Yifan
Xin, Bin
He, Bin
Ding, Yulong
Neural Computing and Applications, 2024, 36 (31) : 19667 - 19690
[37] The decision method research on air combat game based onuncertain interval information
Chen, Xia
Zhao, Mingming
2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, : 456 - 459
[38] Value-filter based air-combat maneuvering optimization
Fu Y.
Deng X.
Zhu Z.
Zhang L.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (22):
[39] An Intelligent TCP Congestion Control Method Based on Deep Q Network
Wang, Yinfeng
Wang, Longxiang
Dong, Xiaoshe
FUTURE INTERNET, 2021, 13 (10):
[40] Research on Combat Deduction Platform Technology for Intelligent Operational Decision
Liao, Xin
Sun, Zheng-hao
PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 1 - 13

← 1 2 3 4 5 →