The Art of Imitation: Learning Long-Horizon Manipulation Tasks From Few Demonstrations

被引:1
|
作者
von Hartz, Jan Ole [1 ]
Welschehold, Tim [1 ]
Valada, Abhinav [1 ]
Boedecker, Joschka [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, D-79085 Freiburg, Germany
来源
关键词
Hidden Markov models; Trajectory; Visualization; Adaptation models; Robot kinematics; Gaussian mixture model; End effectors; Robot sensing systems; Motion segmentation; Data models; Imitation learning; learning from demonstration; sensorimotor learning; MOVEMENT; MIXTURE;
D O I
10.1109/LRA.2024.3487506
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task Parametrized Gaussian Mixture Models (TP-GMM) are a sample-efficient method for learning object-centric robot manipulation tasks. However, there are several open challenges to applying TP-GMMs in the wild. In this work, we tackle three crucial challenges synergistically. First, end-effector velocities are non-Euclidean and thus hard to model using standard GMMs. We thus propose to factorize the robot's end-effector velocity into its direction and magnitude, and model them using Riemannian GMMs. Second, we leverage the factorized velocities to segment and sequence skills from complex demonstration trajectories. Through the segmentation, we further align skill trajectories and hence leverage time as a powerful inductive bias. Third, we present a method to automatically detect relevant task parameters per skill from visual observations. Our approach enables learning complex manipulation tasks from just five demonstrations while using only RGB-D observations. Extensive experimental evaluations on RLBench demonstrate that our approach achieves state-of-the-art performance with 20-fold improved sample efficiency. Our policies generalize across different environments, object instances, and object positions, while the learned skills are reusable.
引用
收藏
页码:11369 / 11376
页数:8
相关论文
共 50 条
  • [1] Hierarchical Learning from Demonstrations for Long-Horizon Tasks
    Li, Boyao
    Li, Jiayi
    Lu, Tao
    Cai, Yinghao
    Wang, Shuo
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4545 - 4551
  • [2] Modular Reinforcement Learning In Long-Horizon Manipulation Tasks
    Vavrecka, Michal
    Kriz, Jonas
    Sokovnin, Nikita
    Sejnova, Gabriela
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT X, 2024, 15025 : 299 - 312
  • [3] GTI: Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations
    Mandlekar, Ajay
    Xu, Danfei
    Martin-Martin, Roberto
    Savarese, Silvio
    Li Fei-Fei
    ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [4] Imitation Learning of Long-Horizon Manipulation Tasks Through Temporal Sub-action Sequencing
    Singh, Niharika
    Dutta, Samrat
    Jain, Aditya
    Prakash, Ravi
    Majumder, Anima
    Sinha, Rajesh
    Behera, Laxmidhar
    Sandhan, Tushar
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 347 - 361
  • [5] Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
    Gupta, Abhishek
    Kumar, Vikash
    Lynch, Corey
    Levine, Sergey
    Hausman, Karol
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [6] Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks
    Li, Zhihao
    Sun, Zhenglong
    Su, Jionglong
    Zhang, Jiaming
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1229 - 1234
  • [7] Disturbance Injection Under Partial Automation: Robust Imitation Learning for Long-Horizon Tasks
    Tahara, Hirotaka
    Sasaki, Hikaru
    Oh, Hanbit
    Anarossi, Edgar
    Matsubara, Takamitsu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2724 - 2731
  • [8] SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks
    Wu, Bohan
    Xu, Feng
    He, Zhanpeng
    Gupta, Abhi
    Allen, Peter K.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9720 - 9727
  • [9] Skill Learning for Long-Horizon Sequential Tasks
    Alves, Joao
    Lau, Nuno
    Silva, Filipe
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 713 - 724
  • [10] Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning
    Li, Rui
    Zou, Zhengbo
    ADVANCED ENGINEERING INFORMATICS, 2023, 58