Air combat maneuver decision based on deep reinforcement learning with auxiliary reward

被引:1
|
作者
Zhang, Tingyu [1 ]
Wang, Yongshuai [1 ,2 ]
Sun, Mingwei [1 ,2 ]
Chen, Zengqiang [1 ,2 ]
机构
[1] College of Artificial Intelligence, Nankai University, Tianjin,300350, China
[2] Key Laboratory of Intelligent Robotics of Tianjin, Tianjin,300350, China
关键词
Deep learning - Network layers;
D O I
10.1007/s00521-024-09720-z
中图分类号
学科分类号
摘要
For air combat maneuvering decision, the sparse reward during the application of deep reinforcement learning limits the exploration efficiency of the agents. To address this challenge, we propose an auxiliary reward function considering the impact of angle, range, and altitude. Furthermore, we investigate the influences of the network nodes, layers, and the learning rate on decision system, and reasonable parameter ranges are provided, which can serve as a guideline. Finally, four typical air combat scenarios demonstrate good adaptability and effectiveness of the proposed scheme, and the auxiliary reward significantly improves the learning ability of deep Q network (DQN) by leading the agents to explore more intently. Compared with the original deep deterministic policy gradient and soft actor critic algorithm, the proposed method exhibits superior exploration capability with higher reward, indicating that the trained agent can adapt to different air combats with good performance. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:13341 / 13356
页数:15
相关论文
共 50 条
  • [1] Maneuver decision of UCAV in air combat based on deep reinforcement learning
    Li, Yongfeng
    Shi, Jingping
    Zhang, Weiguo
    Jiang, Wei
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
  • [2] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
    Yin, Shuhui
    Kang, Yu
    Zhao, Yunbo
    Xue, Jian
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
  • [3] Air combat maneuver decision-making test based on deep reinforcement learning
    Zhang, Sheng
    Zhou, Pan
    He, Yang
    Huang, Jiangtao
    Liu, Gang
    Tang, Jigang
    Jia, Huaizhi
    Du, Xin
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (10):
  • [4] Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning
    Fan, Zihao
    Xu, Yang
    Kang, Yuhang
    Luo, Delin
    [J]. MACHINES, 2022, 10 (11)
  • [5] Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning
    Yang, Qiming
    Zhang, Jiandong
    Shi, Guoqing
    Hu, Jinwen
    Wu, Yong
    [J]. IEEE ACCESS, 2020, 8 : 363 - 378
  • [6] Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning
    Hu, Jinwen
    Wang, Luhe
    Hu, Tianmi
    Guo, Chubing
    Wang, Yanxiong
    [J]. ELECTRONICS, 2022, 11 (03)
  • [7] Air Combat Maneuver Decision Based on Reinforcement Genetic Algorithm
    Xie, Jianfeng
    Yang, Qiming
    Dai, Shuling
    Wang, Wanyang
    Zhang, Jiandong
    [J]. Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2020, 38 (06): : 1330 - 1338
  • [8] Decision-making method for air combat maneuver based on explainable reinforcement learning
    Yang, Shuheng
    Zhang, Dong
    Xiong, Wei
    Ren, Zhi
    Tang, Shuo
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):
  • [9] Deep Reinforcement Learning-Based Air-to-Air Combat Maneuver Generation in a Realistic Environment
    Bae, Jung Ho
    Jung, Hoseong
    Kim, Seogbong
    Kim, Sungho
    Kim, Yong-Duk
    [J]. IEEE ACCESS, 2023, 11 : 26427 - 26440
  • [10] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    ZHANG Jiandong
    YANG Qiming
    SHI Guoqing
    LU Yi
    WU Yong
    [J]. Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438