Sequential robot imitation learning from observations

被引:7
|
作者
Tanwani, Ajay Kumar [1 ]
Yan, Andy [1 ]
Lee, Jonathan [1 ]
Calinon, Sylvain [2 ]
Goldberg, Ken [1 ]
机构
[1] Univ Calif Berkeley, 2111 Etcheverry Hall,2505 Hearst Ave, Berkeley, CA 94709 USA
[2] Idiap Res Inst, Valais, Switzerland
来源
基金
欧盟地平线“2020”;
关键词
Hidden semi-Markov model; robot learning; imitation learning; learning and adaptive systems; HIDDEN MARKOV-MODELS; MANIPULATION TASKS; MIXTURES; TUTORIAL;
D O I
10.1177/02783649211032721
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.
引用
收藏
页码:1306 / 1325
页数:20
相关论文
共 50 条
  • [21] HYDRA: Hybrid Robot Actions for Imitation Learning
    Belkhale, Suneel
    Cui, Yuchen
    Sadigh, Dorsa
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [22] A Bayesian approach to imitation learning for robot navigation
    Ollis, Mark
    Huang, Wesley H.
    Happold, Michael
    [J]. 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 715 - 720
  • [23] Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective
    Fan, Youlin
    Jiu, Bo
    Pu, Wenqiang
    Li, Ziniu
    Li, Kang
    Liu, Hongwei
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 4098 - 4114
  • [24] Robot Manipulation Learning Using Generative Adversarial Imitation Learning
    Jabri, Mohamed Khalil
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4893 - 4894
  • [25] Learning the sequential coordinated behavior of teams from observations
    Kaminka, GA
    Fidanboylu, M
    Chang, A
    Veloso, MM
    [J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 111 - 125
  • [26] Insertion of Pause in Drawing from Babbling for Robot's Developmental Imitation Learning
    Nishide, Shun
    Mochizuki, Keita
    Okuno, Hiroshi G.
    Ogata, Tetsuya
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 4785 - 4791
  • [27] Coarse-to-Fine Imitation Learning: Robot Manipulation from a Single Demonstration
    Johns, Edward
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4613 - 4619
  • [28] From biologically realistic imitation to robot teaching via human motor learning
    Oztop, Erhan
    Babic, Jan
    Hale, Joshua
    Cheng, Gordon
    Kawato, Mitsuo
    [J]. NEURAL INFORMATION PROCESSING, PART II, 2008, 4985 : 214 - +
  • [29] Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator
    Park, Jongcheon
    Han, Seungyong
    Lee, S. M.
    [J]. ISA TRANSACTIONS, 2022, 129 : 684 - 690
  • [30] Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
    Sun, Wen
    Venkatraman, Arun
    Gordon, Geoffrey J.
    Boots, Byron
    Bagnell, J. Andrew
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70