Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits

被引:0
|
作者
Yu, Weizhuo [1 ,2 ]
Liu, Chuang [1 ,2 ]
Yue, Xiaokui [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Pursuit-evasion game; Decision making; Deep deterministic policy gradient; Impulsive maneuver; Elliptical orbit; DYNAMICS; DOCKING;
D O I
10.1016/j.conengprac.2024.106072
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The orbital game theory is a fundamental technology for the cleanup of space debris to improve the safety of useful spacecraft in future, thus, this work develops a decision-making method by reinforcement learning technology to implement the pursuit-evasion game in elliptical orbits. The linearized Tschauner-Hempel equation describes the spacecraft's motion and the problem is formulated by game theory. Subsequently, an impulsive maneuvering model in a complete three-dimensional elliptical orbit is established. Then an algorithm based on deep deterministic policy gradient is designed to solve the optimal strategy for the pursuit-evasion game. For the successful decision of the pursuer, an extensive reward function is designed and improved considering the shortest time, optimal fuel, and collision avoidance. Finally, numerical simulations of a pursuit-evasion mission are performed to demonstrate the effectiveness and superiority of the proposed decision-making algorithm. The game success rate of the algorithm against targets with different maneuvering abilities is verified, which implies that the algorithm can be applied in extended scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] SPACECRAFT DECISION-MAKING AUTONOMY USING DEEP REINFORCEMENT LEARNING
    Harris, Andrew
    Teil, Thibaud
    Schaub, Hanspeter
    [J]. SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1757 - 1775
  • [42] A DIMENSION-REDUCTION METHOD FOR THE FINITE-HORIZON SPACECRAFT PURSUIT-EVASION GAME
    Qi-Shuai Wang
    Li, Pei
    Lei, Ting
    Xiao-Feng Liu
    Guo-Ping Cai
    [J]. JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (03) : 1983 - 1998
  • [43] Escape Strategy Based on Apollonius Circles in the Pursuit-Evasion Game
    Huang, Yuting
    Luo, Yifan
    Nie, Yuhan
    Hou, Tianle
    Fu, Xiaowei
    [J]. PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2143 - 2153
  • [44] A Visibility-Based Pursuit-Evasion Game with a Circular Obstacle
    Sourabh Bhattacharya
    Tamer Başar
    Naira Hovakimyan
    [J]. Journal of Optimization Theory and Applications, 2016, 171 : 1071 - 1082
  • [45] The Research Of Aircraft Pursuit-Evasion Game Based on Improved DQN
    Cui, Yameng
    Zheng, Chunsheng
    Liu, Jiarun
    Wang, Huixia
    Hu, Ruiguang
    Wang, Zhaolei
    [J]. PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 857 - 862
  • [46] Application of the hp-adaptive pseudospectral method in spacecraft orbit pursuit-evasion game
    Zhang, Zhongtao
    Zhang, Yakun
    Wang, Bin
    [J]. ADVANCES IN SPACE RESEARCH, 2024, 73 (03) : 1597 - 1610
  • [47] A Fuzzy Reinforcement Learning Algorithm Using a Predictor for Pursuit-Evasion Games
    Awheda, Mostafa D.
    Schwartz, Howard M.
    [J]. 2016 ANNUAL IEEE SYSTEMS CONFERENCE (SYSCON), 2016, : 186 - 193
  • [48] Apollonius Partitions Based Pursuit-evasion Game Strategies by Q-Learning Approach
    Wang, Qing
    Wu, KaiQi
    Ye, JianFeng
    Wu, YongBao
    Xue, Lei
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4843 - 4848
  • [49] A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game
    Al-Talabi, Ahmad A.
    Schwartz, Howard M.
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 243 - 250
  • [50] Missile guidance laws based on pursuit-evasion game formulations
    Shinar, J
    Turetsky, V
    [J]. AUTOMATIC CONTROL IN AEROSPACE 2001, 2002, : 393 - 398