Sequential robot imitation learning from observations

被引：7

作者：

Tanwani, Ajay Kumar ^{[1
]}

Yan, Andy ^{[1
]}

Lee, Jonathan ^{[1
]}

Calinon, Sylvain ^{[2
]}

Goldberg, Ken ^{[1
]}

机构：

[1] Univ Calif Berkeley, 2111 Etcheverry Hall,2505 Hearst Ave, Berkeley, CA 94709 USA

[2] Idiap Res Inst, Valais, Switzerland

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2021年 / 40卷 / 10-11期

基金：

欧盟地平线“2020”;

关键词：

Hidden semi-Markov model; robot learning; imitation learning; learning and adaptive systems; HIDDEN MARKOV-MODELS; MANIPULATION TASKS; MIXTURES; TUTORIAL;

D O I：

10.1177/02783649211032721

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.

引用

页码：1306 / 1325

页数：20

共 50 条

[21] HYDRA: Hybrid Robot Actions for Imitation Learning
Belkhale, Suneel
Cui, Yuchen
Sadigh, Dorsa
[J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[22] A Bayesian approach to imitation learning for robot navigation
Ollis, Mark
Huang, Wesley H.
Happold, Michael
[J]. 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 715 - 720
[23] Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective
Fan, Youlin
Jiu, Bo
Pu, Wenqiang
Li, Ziniu
Li, Kang
Liu, Hongwei
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 4098 - 4114
[24] Robot Manipulation Learning Using Generative Adversarial Imitation Learning
Jabri, Mohamed Khalil
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4893 - 4894
[25] Learning the sequential coordinated behavior of teams from observations
Kaminka, GA
Fidanboylu, M
Chang, A
Veloso, MM
[J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 111 - 125
[26] Insertion of Pause in Drawing from Babbling for Robot's Developmental Imitation Learning
Nishide, Shun
Mochizuki, Keita
Okuno, Hiroshi G.
Ogata, Tetsuya
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 4785 - 4791
[27] Coarse-to-Fine Imitation Learning: Robot Manipulation from a Single Demonstration
Johns, Edward
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4613 - 4619
[28] From biologically realistic imitation to robot teaching via human motor learning
Oztop, Erhan
Babic, Jan
Hale, Joshua
Cheng, Gordon
Kawato, Mitsuo
[J]. NEURAL INFORMATION PROCESSING, PART II, 2008, 4985 : 214 - +
[29] Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator
Park, Jongcheon
Han, Seungyong
Lee, S. M.
[J]. ISA TRANSACTIONS, 2022, 129 : 684 - 690
[30] Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Sun, Wen
Venkatraman, Arun
Gordon, Geoffrey J.
Boots, Byron
Bagnell, J. Andrew
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70

← 1 2 3 4 5 →