Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards

被引:1
|
作者
Wang, Hongbo [1 ]
Zhang, Yao [1 ]
机构
[1] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China
关键词
Orbital pursuit-evasion game; Impulsive thrust; Reinforcement learning; Hierarchical network; Hindsight experience;
D O I
10.1016/j.ast.2024.109618
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
To address the subjectivity of dense reward designs for the orbital pursuit-evasion game with multiple optimization objectives, this paper proposes the reinforcement learning method with a hierarchical network structure to guide game strategies under sparse rewards. Initially, to overcome the convergence challenges in the reinforcement learning training process under sparse rewards, a hierarchical network structure is proposed based on the hindsight experience replay. Subsequently, considering the strict constraints imposed by orbital dynamics on spacecraft state space, the reachable domain method is introduced to refine the subgoal space in the hierarchical network, further facilitating the achievement of subgoals. Finally, by adopting the centralized training-layered execution approach, a complete multi-agent reinforcement learning method with the hierarchical network structure is established, enabling networks at each level to learn effectively in parallel within sparse reward environments. Numerical simulations indicate that, under the single-agent reinforcement learning framework, the proposed method exhibits superior stability in the late training stage and enhances exploration efficiency in the early stage by 38.89% to 55.56% to the baseline method. Under the multi-agent reinforcement learning framework, as the relative distance decreases, the subgoals generated by the hierarchical network transition from long-term to short-term, aligning with human behavioral logic.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Multi-Agent Pursuit-Evasion Game Based on Organizational Architecture
    Souidi M.E.H.
    Siam A.
    Pei Z.
    Piao S.
    Journal of Computing and Information Technology, 2019, 27 (01) : 1 - 12
  • [2] Using Cognitive Behavioral Learning in Multi-Agent Pursuit-Evasion Game
    Kuo, Jong Yih
    Liu, Chien-Hung
    Lee, Fang-Wen
    ASIA MODELLING SYMPOSIUM 2014 (AMS 2014), 2014, : 16 - 20
  • [3] Orbital Impulsive Pursuit-Evasion Game Formulation and Stackelberg Equilibrium Solutions
    Li, Zhenyu
    Luo, Yazhong
    JOURNAL OF SPACECRAFT AND ROCKETS, 2024,
  • [4] A PURSUIT-EVASION GAME IN THE ORBITAL PLANE
    Selvakumar, Jhanani
    Bakolas, Efstathios
    SPACEFLIGHT MECHANICS 2017, PTS I - IV, 2017, 160 : 1105 - 1116
  • [5] Pursuer Assignment and Control Strategies in Multi-Agent Pursuit-Evasion Under Uncertainties
    Zhang, Leiming
    Prorok, Amanda
    Bhattacharya, Subhrajit
    FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [6] Game Tree Search-based Impulsive Orbital Pursuit-Evasion Game with Limited Actions
    Xie, Wenyuan
    Zhao, Liran
    Dang, Zhaohui
    SPACE: SCIENCE & TECHNOLOGY, 2024, 4
  • [7] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
    Hu, Penglin
    Pan, Quan
    Zhao, Chunhui
    Guo, Yaning
    ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
  • [8] Multi-agent Pursuit-Evasion Under Uncertainties with Redundant Robot Assignments EXTENDED ABSTRACT
    Zhang, Leiming
    Prorok, Amanda
    Bhattacharya, Subhrajit
    2019 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS 2019), 2019, : 83 - 85
  • [9] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    AEROSPACE, 2023, 10 (09)
  • [10] Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning
    Zhen-yu Li
    Si Chen
    Chenghong Zhou
    Wei Sun
    The Journal of the Astronautical Sciences, 72 (1)