Adaptive guidance and integrated navigation with reinforcement meta-learning

被引:48
|
作者
Gaudet, Brian [1 ]
Linares, Richard [3 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E James & Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Guidance; Meta learning; Reinforcement learning; Landing guidance;
D O I
10.1016/j.actaastro.2020.01.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt in real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment thus integrating guidance and navigation.
引用
下载
收藏
页码:180 / 190
页数:11
相关论文
共 50 条
  • [1] ADAPTIVE GUIDANCE WITH REINFORCEMENT META LEARNING
    Gaudet, Brian
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 4091 - 4109
  • [2] Deep Reinforcement Meta-learning Guidance with Impact Angle Constraint
    Liang C.
    Wang W.-H.
    Lai C.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (05): : 611 - 620
  • [3] Meta-learning in Reinforcement Learning
    Schweighofer, N
    Doya, K
    NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
  • [4] Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    ACTA ASTRONAUTICA, 2020, 171 : 1 - 13
  • [5] Meta-Learning with Adaptive Hyperparameters
    Baik, Sungyong
    Choi, Myungsub
    Choi, Janghoon
    Kim, Heewon
    Lee, Kyoung Mu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Range-Aware Impact Angle Guidance Law With Deep Reinforcement Meta-Learning
    Liang, Chen
    Wang, Weihong
    Liu, Zhenghua
    Lai, Chao
    Wang, Sen
    IEEE ACCESS, 2020, 8 (08): : 152093 - 152104
  • [7] Learning to Learn How to Learn: Self-Adaptive Visual Navigation using Meta-Learning
    Wortsman, Mitchell
    Ehsani, Kiana
    Rastegari, Mohammad
    Farhadi, Ali
    Mottaghi, Roozbeh
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3743 - 6752
  • [8] Exploring the Relationship of Reward and Punishment in Reinforcement Learning Evolving Action Meta-Learning Functions in Goal Navigation
    Lowe, Robert
    Ziemke, Tom
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 140 - 147
  • [9] Meta-Learning for Multi-objective Reinforcement Learning
    Chen, Xi
    Ghadirzadeh, Ali
    Bjorkman, Marten
    Jensfelt, Patric
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 977 - 983
  • [10] A guidance of data stream characterization for meta-learning
    Debiaso Rossi, Andre Luis
    de Souza, Bruno Feres
    Soares, Carlos
    de Leon Ferreira de Carvalho, Andre Carlos Ponce
    INTELLIGENT DATA ANALYSIS, 2017, 21 (04) : 1015 - 1035