Research on Intelligent Maneuvering Decision in Close Air Combat Based on Deep Q Network

被引:0
|
作者
Zhangl, Tingyu [1 ]
Zheng, Chen [2 ]
Sun, Mingwei [1 ]
Wang, Yongshuai [1 ]
Chen, Zengqiang [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
[2] Beijing Inst Astronaut Syst Engn, Beijing 100076, Peoples R China
基金
中国国家自然科学基金;
关键词
air combat; autonomous maneuvering decision; deep reinforcement learning; DQN; reward function;
D O I
10.1109/DDCLS58216.2023.10166948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the Unmanned Combat Aerial Vehicle(UCAV)maneuvering decision in close air combat, the design of reinforcement learning(RL) reward function and the selection of hyperparameters are studied based on the deep Q network algorithm. Considering the angle, range, altitude, and speed factors, an auxiliary reward function is proposed to solve the sparse reward problem of RL. Meanwhile, aiming at the issue of hyperparameter selection in RL, the influence of learning rate, the number of network nodes, and layers on the decision-making system is explored, and a suitable range of parameters is given, which provides a reference for the subsequent research on parameter selection. In addition, the simulation results show that the trained agent can obtain the optimal maneuver strategy in different air combat situations, but it is sensitive to RL hyperparameters.
引用
收藏
页码:1044 / 1049
页数:6
相关论文
共 50 条
  • [31] Research on Maneuvering Decision Algorithm Based on Improved Deep Deterministic Policy Gradient
    Jing, Xianyong
    Hou, Manyi
    Wu, Gaolong
    Ma, Zongcheng
    Tao, Zhongxiang
    IEEE ACCESS, 2022, 10 : 92426 - 92445
  • [32] Manual-Based Automated Maneuvering Decisions for Air-to-Air Combat
    Yang, Kwangjin
    Kim, Songhyon
    Lee, Younggun
    Jang, Changyoung
    Kim, Yong-Duk
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2024, 21 (01): : 28 - 36
  • [33] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
    Yin, Shuhui
    Kang, Yu
    Zhao, Yunbo
    Xue, Jian
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
  • [34] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
    Zhang T.
    Wang Y.
    Sun M.
    Chen Z.
    Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
  • [35] Reconstruction and evaluation of close air combat decision-making process based on fuzzy clustering
    Zuo, Jialiang
    Yang, Rennong
    Zhang, Ying
    Wu, Meng
    Xiao, Yuze
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2015, 36 (05): : 1650 - 1660
  • [36] Mean policy-based proximal policy optimization for maneuvering decision in multi-UAV air combat
    Zheng, Yifan
    Xin, Bin
    He, Bin
    Ding, Yulong
    Neural Computing and Applications, 2024, 36 (31) : 19667 - 19690
  • [37] The decision method research on air combat game based onuncertain interval information
    Chen, Xia
    Zhao, Mingming
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, : 456 - 459
  • [38] Value-filter based air-combat maneuvering optimization
    Fu Y.
    Deng X.
    Zhu Z.
    Zhang L.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (22):
  • [39] An Intelligent TCP Congestion Control Method Based on Deep Q Network
    Wang, Yinfeng
    Wang, Longxiang
    Dong, Xiaoshe
    FUTURE INTERNET, 2021, 13 (10):
  • [40] Research on Combat Deduction Platform Technology for Intelligent Operational Decision
    Liao, Xin
    Sun, Zheng-hao
    PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 1 - 13