Research on Intelligent Maneuvering Decision in Close Air Combat Based on Deep Q Network

被引:0
|
作者
Zhangl, Tingyu [1 ]
Zheng, Chen [2 ]
Sun, Mingwei [1 ]
Wang, Yongshuai [1 ]
Chen, Zengqiang [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
[2] Beijing Inst Astronaut Syst Engn, Beijing 100076, Peoples R China
基金
中国国家自然科学基金;
关键词
air combat; autonomous maneuvering decision; deep reinforcement learning; DQN; reward function;
D O I
10.1109/DDCLS58216.2023.10166948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the Unmanned Combat Aerial Vehicle(UCAV)maneuvering decision in close air combat, the design of reinforcement learning(RL) reward function and the selection of hyperparameters are studied based on the deep Q network algorithm. Considering the angle, range, altitude, and speed factors, an auxiliary reward function is proposed to solve the sparse reward problem of RL. Meanwhile, aiming at the issue of hyperparameter selection in RL, the influence of learning rate, the number of network nodes, and layers on the decision-making system is explored, and a suitable range of parameters is given, which provides a reference for the subsequent research on parameter selection. In addition, the simulation results show that the trained agent can obtain the optimal maneuver strategy in different air combat situations, but it is sensitive to RL hyperparameters.
引用
收藏
页码:1044 / 1049
页数:6
相关论文
共 50 条
  • [21] Air combat intelligent decision-making method based on self-play and deep reinforcement learning
    Shan, Shengzhe
    Zhang, Weiwei
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (04):
  • [22] Research on Maneuvering Decisions for Multi-UAVs Air Combat
    Xie Rong-zeng
    Li Jie-ying
    Luo De-lin
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 767 - 772
  • [23] Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat
    Zhou, Pan
    Li, Ni
    Huang, Jiangtao
    Zhang, Sheng
    Zhou, Xiaoyu
    Liu, Gang
    2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023, 2024, 1051 : 320 - 334
  • [24] A Deep Q-Network Based Intelligent Decision-Making Approach for Cognitive Radar
    Tian, Yong
    Wang, Peng
    Hou, Xinyue
    Yu, Junpeng
    Peng, Xiaoyan
    Liao, Hongshu
    Gao, Lin
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (04) : 719 - 726
  • [25] Intelligent Aircraft Maneuvering Decision Based on CNN
    Li, Bo
    Liang, Shiyang
    Tian, Linyu
    Chen, Daqing
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [26] Research on Autonomous Maneuvering Decision of UCAV Based on Deep Reinforcement Learning
    Zhang, Yesheng
    Hi, Wei
    Gao, Yang
    Chang, Hongxing
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 230 - 235
  • [27] Perception Error-resistant Air Combat Maneuvering Decisions Based on Deep Reinforcement Learning
    Tian, Chengbin
    Li, Hui
    Chen, Xiliang
    Wu, Fengguo
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 56 (06): : 270 - 282
  • [28] Maneuver decision of UCAV in air combat based on deep reinforcement learning
    Li, Yongfeng
    Shi, Jingping
    Zhang, Weiguo
    Jiang, Wei
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
  • [29] Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
    Kong W.-R.
    Zhou D.-Y.
    Zhao Y.-Y.
    Yang W.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 352 - 362
  • [30] Research on UAV Air Combat Maneuver Decision Based on Decision Tree CART Algorithm
    Liu, Haotian
    Jin, Jiangfeng
    Liu, Kun
    Zhang, Jiaping
    Niu, Yanan
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2638 - 2650