A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game

被引:0
|
作者
Yan, Tian [1 ,2 ,3 ]
Liu, Can [1 ]
Gao, Mengjing [1 ]
Jiang, Zijian [1 ]
Li, Tong [1 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Natl Key Lab Unmanned Aerial Vehicle Technol, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Integrated Res & Dev Platform Unmanned Aerial Vehi, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
pursuit-evasion game; line-of-sight angle rate; high-speed UAV; deep reinforcement learning; PROPORTIONAL NAVIGATION;
D O I
10.3390/drones8070309
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Given the rapid advancements in kinetic pursuit technology, this paper introduces an innovative maneuvering strategy, denoted as LSRC-TD3, which integrates line-of-sight (LOS) angle rate correction with deep reinforcement learning (DRL) for high-speed unmanned aerial vehicle (UAV) pursuit-evasion (PE) game scenarios, with the aim of effectively evading high-speed and high-dynamic pursuers. In the challenging situations of the game, where both speed and maximum available overload are at a disadvantage, the playing field of UAVs is severely compressed, and the difficulty of evasion is significantly increased, placing higher demands on the strategy and timing of maneuvering to change orbit. While considering evasion, trajectory constraint, and energy consumption, we formulated the reward function by combining "terminal" and "process" rewards, as well as "strong" and "weak" incentive guidance to reduce pre-exploration difficulty and accelerate convergence of the game network. Additionally, this paper presents a correction factor for LOS angle rate into the double-delay deterministic gradient strategy (TD3), thereby enhancing the sensitivity of high-speed UAVs to changes in LOS rate, as well as the accuracy of evasion timing, which improves the effectiveness and adaptive capability of the intelligent maneuvering strategy. The Monte Carlo simulation results demonstrate that the proposed method achieves a high level of evasion performance-integrating energy optimization with the requisite miss distance for high-speed UAVs-and accomplishes efficient evasion under highly challenging PE game scenarios.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    [J]. AEROSPACE, 2023, 10 (09)
  • [2] Intelligent Pursuit-Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles
    Gao, Mengjing
    Yan, Tian
    Li, Quancheng
    Fu, Wenxing
    Zhang, Jin
    [J]. AEROSPACE, 2023, 10 (01)
  • [3] A simplified pursuit-evasion game with reinforcement learning
    Paczolay, Gabor
    Harmati, Istvan
    [J]. Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166
  • [4] Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning
    Zhang, Ruilong
    Zong, Qun
    Zhang, Xiuyun
    Dou, Liqian
    Tian, Bailing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7900 - 7909
  • [5] Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning
    Xu, Can
    Zhang, Yin
    Wang, Weigang
    Dong, Ligang
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [6] A UAV Pursuit-Evasion Strategy Based on DDPG and Imitation Learning
    Fu, Xiaowei
    Zhu, Jindong
    Wei, Zhaoying
    Wang, Hui
    Li, Sili
    [J]. INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2022, 2022
  • [7] Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment
    Qu, Xiuqing
    Gan, Wenhao
    Song, Dalei
    Zhou, Liqin
    [J]. OCEAN ENGINEERING, 2023, 273
  • [8] Strategy solution of non-cooperative target pursuit-evasion game based on branching deep reinforcement learning
    Liu, Bingyan
    Ye, Xiongbing
    Gao, Yong
    Wang, Xinbo
    Ni, Lei
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2020, 41 (10):
  • [9] Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits
    Yu, Weizhuo
    Liu, Chuang
    Yue, Xiaokui
    [J]. CONTROL ENGINEERING PRACTICE, 2024, 153
  • [10] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
    Wang, Maolin
    Wang, Lixin
    Yue, Ting
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155