Sequential robot imitation learning from observations

被引:7
|
作者
Tanwani, Ajay Kumar [1 ]
Yan, Andy [1 ]
Lee, Jonathan [1 ]
Calinon, Sylvain [2 ]
Goldberg, Ken [1 ]
机构
[1] Univ Calif Berkeley, 2111 Etcheverry Hall,2505 Hearst Ave, Berkeley, CA 94709 USA
[2] Idiap Res Inst, Valais, Switzerland
来源
基金
欧盟地平线“2020”;
关键词
Hidden semi-Markov model; robot learning; imitation learning; learning and adaptive systems; HIDDEN MARKOV-MODELS; MANIPULATION TASKS; MIXTURES; TUTORIAL;
D O I
10.1177/02783649211032721
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.
引用
收藏
页码:1306 / 1325
页数:20
相关论文
共 50 条
  • [1] Off-Policy Imitation Learning from Observations
    Zhu, Zhuangdi
    Lin, Kaixiang
    Dai, Bo
    Zhou, Jiayu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] To Follow or not to Follow: Selective Imitation Learning from Observations
    Lee, Youngwoon
    Hu, Edward S.
    Yang, Zhengyu
    Lim, Joseph J.
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [3] On-Policy Robot Imitation Learning from a Converging Supervisor
    Balakrishna, Ashwin
    Thananjeyan, Brijen
    Lee, Jonathan
    Li, Felix
    Zahed, Arsh
    Gonzalez, Joseph E.
    Goldberg, Ken
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [4] Architecture for a Robot Learning by Imitation System
    Bandera, J. P.
    Molina-Tanco, L.
    Rodriguez, J. A.
    Bandera, A.
    [J]. MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 87 - 92
  • [5] Efficient Robot Skill Learning: Grounded Simulation Learning and Imitation Learning from Observation
    Stone, Peter
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC), 2021, : 3 - 3
  • [6] Robot learning-Beyond imitation
    Yang, Guang-Zhong
    [J]. SCIENCE ROBOTICS, 2019, 4 (26)
  • [7] Learning Responsive Robot Behavior by Imitation
    Ben Amor, Heni
    Vogt, David
    Ewerton, Marco
    Berger, Erik
    Jung, Bernhard
    Peters, Jan
    [J]. 2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3257 - 3264
  • [8] Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
    Yang, Chao
    Ma, Xiaojian
    Huang, Wenbing
    Sun, Fuchun
    Liu, Huaping
    Huang, Junzhou
    Gan, Chuang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] A posteriori control densities: Imitation learning from partial observations
    Lefebvre, Tom
    Crevecoeur, Guillaume
    [J]. PATTERN RECOGNITION LETTERS, 2023, 169 : 87 - 94
  • [10] Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor
    Seita, Daniel
    Ganapathi, Aditya
    Hoque, Ryan
    Hwang, Minho
    Cen, Edward
    Tanwani, Ajay Kumar
    Balakrishna, Ashwin
    Thananjeyan, Brijen
    Ichnowski, Jeffrey
    Jamali, Nawid
    Yamane, Katsu
    Iba, Soshi
    Canny, John
    Goldberg, Ken
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9651 - 9658