Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads

被引:50
|
作者
Belkhale, Suneel [1 ]
Li, Rachel [1 ]
Kahn, Gregory [1 ]
McAllister, Rowan [1 ]
Calandra, Roberto [2 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94710 USA
[2] Facebook AI Res, Menlo Pk, CA 94025 USA
来源
基金
美国国家科学基金会;
关键词
Machine learning for robot control; probabilistic inference; reinforcement learning; LEVEL CONTROL;
D O I
10.1109/LRA.2021.3057046
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Transporting suspended payloads is challenging for autonomous aerial vehicles because the payload can cause significant and unpredictable changes to the robot's dynamics. These changes can lead to suboptimal flight performance or even catastrophic failure. Although adaptive control and learning-based methods can in principle adapt to changes in these hybrid robotpayload systems, rapid mid-flight adaptation to payloads that have a priori unknown physical properties remains an open problem. We propose a meta-learning approach that "learns how to learn" models of altered dynamics within seconds of post-connection flight data. Our experiments demonstrate that our online adaptation approach outperforms non-adaptive methods on a series of challenging suspended payload transportation tasks. Videos and other supplemental material are available on our website: https://sites.google.com/view/meta-rl-for-flight
引用
收藏
页码:1471 / 1478
页数:8
相关论文
共 50 条
  • [1] Model-based Adversarial Meta-Reinforcement Learning
    Lin, Zichuan
    Thomas, Garrett
    Yang, Guangwen
    Ma, Tengyu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Learning Locomotion Skills via Model-based Proximal Meta-Reinforcement Learning
    Xiao, Qing
    Cao, Zhengcai
    Zhou, Mengchu
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1545 - 1550
  • [3] Industrial Insert Robotic Assembly Based on Model-based Meta-Reinforcement Learning
    Liu, Dong
    Zhang, Xiamin
    Du, Yu
    Gao, Dan
    Wang, Minghao
    Cong, Ming
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1508 - 1512
  • [4] A Federated Meta-Reinforcement Learning Algorithm Based on Gradient Correction
    Qin, Zerui
    Yue, Sheng
    [J]. PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 220 - 221
  • [5] Meta-Reinforcement Learning Algorithm Based on Reward and Dynamic Inference
    Chen, Jinhao
    Zhang, Chunhong
    Hu, Zheng
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 223 - 234
  • [6] Prefrontal cortex as a meta-reinforcement learning system
    Jane X. Wang
    Zeb Kurth-Nelson
    Dharshan Kumaran
    Dhruva Tirumala
    Hubert Soyer
    Joel Z. Leibo
    Demis Hassabis
    Matthew Botvinick
    [J]. Nature Neuroscience, 2018, 21 : 860 - 868
  • [7] Offline Meta-Reinforcement Learning for Industrial Insertion
    Zhao, Tony Z.
    Luo, Jianlan
    Sushkov, Oleg
    Pevceviciute, Rugile
    Heess, Nicolas
    Scholz, Jon
    Schaal, Stefan
    Levine, Sergey
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6386 - 6393
  • [8] On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning
    Fallah, Alireza
    Georgiev, Kristian
    Mokhtari, Aryan
    Ozdaglar, Asuman
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Offline Meta-Reinforcement Learning with Advantage Weighting
    Mitchell, Eric
    Rafailov, Rafael
    Peng, Xue Bin
    Levine, Sergey
    Finn, Chelsea
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Meta-Reinforcement Learning in Nonstationary and Nonparametric Environments
    Bing, Zhenshan
    Knak, Lukas
    Cheng, Long
    Morin, Fabrice O.
    Huang, Kai
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15