Q-LEARNING ALGORITHM FOR PATH-PLANNING TO MANEUVER THROUGH A SATELLITE CLUSTER

被引:0
|
作者
Chu, Xiaoyu [1 ]
Alfriend, Kyle T. [2 ]
Zhang, Jingrui [1 ]
Zhang, Yao [1 ]
机构
[1] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China
[2] Texas A&M Univ, Dept Aerosp Engn, College Stn, TX 77843 USA
来源
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In this paper, a path planning method for maneuvering through a satellite cluster using Q-learning is presented. An on-orbit servicing spacecraft is supposed to rendezvous with the failed central satellite of a formation and avoid collisions with the other satellites. The dynamic model of the satellite cluster is first established by Lawden equations. Then the theory of Q-learning is introduced and the reward shaping is specified to guide the learning system quickly to success. Furthermore, combining Q-leaming with deep neural networks, deep Q-network (DQN) is employed when the dimension of the problem is enormous. Finally, the rendezvous mission is simulated in 2D and 3D scenarios separately to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:2063 / 2082
页数:20
相关论文
共 50 条
  • [41] Simulation for Path Planning of Autonomous Underwater Vehicle Using Flower Pollination Algorithm, Genetic Algorithm and Q-Learning
    Gautam, Utkarsh
    Malmathanraj, R.
    Srivastav, Chhavi
    [J]. 2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [42] Path-planning algorithm for transportation of molecules through protein tunnel bottlenecks
    Byska, Jan
    Kolingerova, Ivana
    Kozlikova, Barbora
    Sochor, Jiri
    [J]. PROCEEDINGS SCCG: 2015 31ST SPRING CONFERENCE ON COMPUTER GRAPHICS, 2015, : 80 - 87
  • [43] Real-Time Path Planning Through Q-learning's Exploration Strategy Adjustment
    Kim, Howon
    Lee, WonChang
    [J]. 2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
  • [44] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
    Wang, Yin-Hao
    Li, Tzuu-Hseng S.
    Lin, Chih-Jui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
  • [45] An immune plasma algorithm with Q-learning based pandemic management for path planning of unmanned aerial vehicles
    Aslan, Selcuk
    Demirci, Sercan
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [46] A Cooperative Q-Learning Path Planning Algorithm for Origin-Destination Pairs in Urban Road Networks
    Zhang, Xiaoyong
    Li, Heng
    Peng, Jun
    Liu, Weirong
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [47] Implementation of a path-planning algorithm for a robot arm
    Rohrmoser, B
    Parlitz, C
    [J]. ROBOTIK 2002, 2002, 1679 : 59 - 64
  • [48] An Adaptive Conversion Speed Q-Learning Algorithm for Search and Rescue UAV Path Planning in Unknown Environments
    Wu, Jiehong
    Sun, Ya'nan
    Li, Danyang
    Shi, Junling
    Li, Xianwei
    Gao, Lijun
    Yu, Lei
    Han, Guangjie
    Wu, Jinsong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) : 15391 - 15404
  • [49] A 3D Q-Learning Algorithm for Offline UAV Path Planning with Priority Shifting Rewards
    de Carvalho, Kevin Braathen
    Batista, Hiago B.
    de Oliveira, Iure L.
    Brandao, Alexandre S.
    [J]. 2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, : 169 - 174
  • [50] The Application of a Hybrid Algorithm to the Submersible Path-Planning
    Lv, Chongyang
    Yu, Fei
    Yang, Na
    Feng, Jin
    Zou, Meikui
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT I, 2012, 7331 : 470 - 478