Sequential robot imitation learning from observations

被引：7

作者：

Tanwani, Ajay Kumar ^{[1
]}

Yan, Andy ^{[1
]}

Lee, Jonathan ^{[1
]}

Calinon, Sylvain ^{[2
]}

Goldberg, Ken ^{[1
]}

机构：

[1] Univ Calif Berkeley, 2111 Etcheverry Hall,2505 Hearst Ave, Berkeley, CA 94709 USA

[2] Idiap Res Inst, Valais, Switzerland

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2021年 / 40卷 / 10-11期

基金：

欧盟地平线“2020”;

关键词：

Hidden semi-Markov model; robot learning; imitation learning; learning and adaptive systems; HIDDEN MARKOV-MODELS; MANIPULATION TASKS; MIXTURES; TUTORIAL;

D O I：

10.1177/02783649211032721

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.

引用

页码：1306 / 1325

页数：20

共 50 条

[1] Off-Policy Imitation Learning from Observations
Zhu, Zhuangdi
Lin, Kaixiang
Dai, Bo
Zhou, Jiayu
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] To Follow or not to Follow: Selective Imitation Learning from Observations
Lee, Youngwoon
Hu, Edward S.
Yang, Zhengyu
Lim, Joseph J.
[J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[3] On-Policy Robot Imitation Learning from a Converging Supervisor
Balakrishna, Ashwin
Thananjeyan, Brijen
Lee, Jonathan
Li, Felix
Zahed, Arsh
Gonzalez, Joseph E.
Goldberg, Ken
[J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[4] Architecture for a Robot Learning by Imitation System
Bandera, J. P.
Molina-Tanco, L.
Rodriguez, J. A.
Bandera, A.
[J]. MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 87 - 92
[5] Efficient Robot Skill Learning: Grounded Simulation Learning and Imitation Learning from Observation
Stone, Peter
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC), 2021, : 3 - 3
[6] Robot learning-Beyond imitation
Yang, Guang-Zhong
[J]. SCIENCE ROBOTICS, 2019, 4 (26)
[7] Learning Responsive Robot Behavior by Imitation
Ben Amor, Heni
Vogt, David
Ewerton, Marco
Berger, Erik
Jung, Bernhard
Peters, Jan
[J]. 2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 3257 - 3264
[8] Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
Yang, Chao
Ma, Xiaojian
Huang, Wenbing
Sun, Fuchun
Liu, Huaping
Huang, Junzhou
Gan, Chuang
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[9] A posteriori control densities: Imitation learning from partial observations
Lefebvre, Tom
Crevecoeur, Guillaume
[J]. PATTERN RECOGNITION LETTERS, 2023, 169 : 87 - 94
[10] Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor
Seita, Daniel
Ganapathi, Aditya
Hoque, Ryan
Hwang, Minho
Cen, Edward
Tanwani, Ajay Kumar
Balakrishna, Ashwin
Thananjeyan, Brijen
Ichnowski, Jeffrey
Jamali, Nawid
Yamane, Katsu
Iba, Soshi
Canny, John
Goldberg, Ken
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9651 - 9658

← 1 2 3 4 5 →