Orbital Interception Pursuit Strategy for Random Evasion Using Deep Reinforcement Learning

被引:9
|
作者
Jiang, Rui [1 ]
Ye, Dong [1 ]
Xiao, Yan [1 ]
Sun, Zhaowei [1 ]
Zhang, Zeming [1 ,2 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Res Ctr Satellite Technol, Harbin, Peoples R China
[2] Politecn Milan, Dept Aerosp Sci & Technol, Space Missions Engn Lab, Milan, Italy
来源
基金
中国国家自然科学基金;
关键词
Reinforcement learning;
D O I
10.34133/space.0086
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Aiming at the interception problem of noncooperative evader spacecraft adopting random maneuver strategy in one-to-one orbital pursuit-evasion problem, an interception strategy with decision-making training mechanism for the pursuer based on deep reinforcement learning is proposed. Its core purpose is to improve the success rate of interception in the environment with high uncertainty. First of all, a multi-impulse orbit transfer model of pursuer and evader is established, and a modular deep reinforcement learning training method is built. Second, an effective reward mechanism is proposed to train the pursuer to choose the impulse direction and impulse interval of the orbit transfer and to learn the successful interception strategy with the optimal fuel and time. Finally, with the evader taking a random maneuver decision in each episode of training, the trained decision-making strategy is applied to the pursuer, the corresponding interception success rate of which is further analyzed. The results show that the pursuer trained can obtain universal and variable interception strategy. In each round of pursuit-evasion, with random maneuver strategy of the evader, the pursuer can adopt similar optimal decisions to deal with high-dimensional environments and thoroughly random state space, maintaining high interception success rate.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [21] A Fuzzy Reinforcement Learning Algorithm Using a Predictor for Pursuit-Evasion Games
    Awheda, Mostafa D.
    Schwartz, Howard M.
    2016 ANNUAL IEEE SYSTEMS CONFERENCE (SYSCON), 2016, : 186 - 193
  • [22] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
    Geng Y.-Z.
    Yuan L.
    Huang H.
    Tang L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
  • [23] Web Bot Detection Evasion Using Deep Reinforcement Learning
    Iliou, Christos
    Kostoulas, Theodoros
    Tsikrika, Theodora
    Katos, Vasilis
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [24] Autonomous drone interception with Deep Reinforcement Learning
    Bertoin, David
    Gauffriau, Adrien
    Grasset, Damien
    Gupta, Jayant Sen
    CEUR Workshop Proceedings, 2022, 3173
  • [25] An escape strategy in orbital pursuit-evasion games with incomplete information
    LI ZhenYu
    ZHU Hai
    LUO YaZhong
    Science China Technological Sciences, 2021, 64 (03) : 559 - 570
  • [26] An escape strategy in orbital pursuit-evasion games with incomplete information
    LI ZhenYu
    ZHU Hai
    LUO YaZhong
    Science China(Technological Sciences), 2021, (03) : 559 - 570
  • [27] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Shuhua
    Liu, Jie
    Cheng, Yu
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (03): : 635 - 645
  • [28] A Guidance Strategy for Multi-player Pursuit and Evasion Game in Maneuvering Target Interception
    Wang, Ting-Kuo
    Fu, Li-Chen
    2013 9TH ASIAN CONTROL CONFERENCE (ASCC), 2013,
  • [29] Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number
    Borra, Francesco
    Biferale, Luca
    Cencini, Massimo
    Celani, Antonio
    PHYSICAL REVIEW FLUIDS, 2022, 7 (02)
  • [30] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Jie
    Liu, Shuhua
    Wu, Hongyan
    Zhang, Yu
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL II, 2009, : 482 - 486