What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data

被引:19
|
作者
Mees, Oier [1 ]
Hermann, Lukas [1 ]
Burgard, Wolfram [2 ]
机构
[1] Univ Freiburg, D-79085 Freiburg, Germany
[2] Tech Univ Nuremberg, D-90489 Nurnberg, Germany
关键词
Imitation learning; learning categories and concepts; machine learning for robot control;
D O I
10.1109/LRA.2022.3196123
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
A long-standing goal in robotics is to build robots that can perform a wide range of daily tasks from perceptions obtained with their onboard sensors and specified only via natural language. While recently substantial advances have been achieved in language-driven robotics by leveraging end-to-end learning from pixels, there is no clear and well-understood process for making various design choices due to the underlying variation in setups. In this letter, we conduct an extensive study of the most critical challenges in learning language conditioned policies from offline free-form imitation datasets. We further identify architectural and algorithmic techniques that improve performance, such as a hierarchical decomposition of the robot control learning, a multimodal transformer encoder, discrete latent plans and a self-supervised contrastive loss that aligns video and language representations. By combining the results of our investigation with our improved model components, we are able to present a novel approach that significantly outperforms the state of the art on the challenging language conditioned long-horizon robot manipulation CALVIN benchmark. We have open-sourced our implementation to facilitate future research in learning to perform many complex manipulation skills in a row specified with natural language.
引用
收藏
页码:11205 / 11212
页数:8
相关论文
共 24 条
  • [1] Language Conditioned Imitation Learning over Unstructured Data
    Lynch, Corey
    Sermanet, Pierre
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [2] Language-Conditioned Imitation Learning With Base Skill Priors Under Unstructured Data
    Zhou, Hongkuan
    Bing, Zhenshan
    Yao, Xiangtong
    Su, Xiaojie
    Yang, Chenguang
    Huang, Kai
    Knoll, Alois
    [J]. IEEE Robotics and Automation Letters, 2024, 9 (11) : 9805 - 9812
  • [3] What Matters for Adversarial Imitation Learning?
    Orsini, Manu
    Raichuk, Anton
    Hussenot, Leonard
    Vincent, Damien
    Dadashi, Robert
    Girgin, Sertan
    Geist, Matthieu
    Bachem, Olivier
    Pietquin, Olivier
    Andrychowicz, Marcin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [4] Language-Conditioned Imitation Learning for Robot Manipulation Tasks
    Stepputtis, Simon
    Campbell, Joseph
    Phielipp, Mariano
    Lee, Stefan
    Baral, Chitta
    Ben Amor, Heni
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] Imitation Learning based on Data Augmentation for Robotic Reaching
    Hoshino, Satoshi
    Hisada, Tomoki
    Oikawa, Ryota
    [J]. 2021 60TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2021, : 417 - 424
  • [6] Multimodal Data Matters: Language Model Pre-Training Over Structured and Unstructured Electronic Health Records
    Liu, Sicen
    Wang, Xiaolong
    Hou, Yongshuai
    Li, Ge
    Wang, Hui
    Xu, Hui
    Xiang, Yang
    Tang, Buzhou
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 504 - 514
  • [7] Grounding Language with Visual Affordances over Unstructured Data
    Mees, Oier
    Borja-Diaz, Jessica
    Burgard, Wolfram
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11576 - 11582
  • [8] y Sampling Approach Matters: Active Learning for Robotic Language Acquisition
    Pillai, Nisha
    Raff, Edward
    Ferraro, Francis
    Matuszek, Cynthia
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5191 - 5200
  • [9] Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
    Arora, Kushal
    El Asri, Layla
    Bahuleyan, Hareesh
    Cheung, Jackie Chi Kit
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 700 - 710
  • [10] An imitation learning approach for the control of a low-cost low-accuracy robotic arm for unstructured environments
    Fabio Bonsignorio
    Cristiano Cervellera
    Danilo Macciò
    Enrica Zereik
    [J]. International Journal of Intelligent Robotics and Applications, 2023, 7 : 13 - 30