Self-Supervised Correspondence in Visuomotor Policy Learning

被引:44
|
作者
Florence, Peter [1 ]
Manuelli, Lucas [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Deep learning in robotics and automation; perception for grasping and manipulation; visual learning; MANIPULATION;
D O I
10.1109/LRA.2019.2956365
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter, we explore using self-supervised correspondence for improving the generalization performance and sample efficiency of visuomotor policy learning. Prior work has primarily used approaches such as autoencoding, pose-based losses, and end-to-end policy optimization in order to train the visual portion of visuomotor policies. We instead propose an approach using self-supervised dense visual correspondence training and show that this enables visuomotor policy learning with surprisingly high generalization performance with modest amounts of data. Using imitation learning, we demonstrate extensive hardware validation on challenging manipulation tasks with as few as 50 demonstrations. Our learned policies can generalize across classes of objects, react to deformable object configurations, and manipulate textureless symmetrical objects in a variety of backgrounds, all with closed-loop, real-time vision-based policies. Simulated imitation learning experiments suggest that correspondence training offers sample complexity and generalization benefits compared to autoencoding and end-to-end training.
引用
下载
收藏
页码:492 / 499
页数:8
相关论文
共 50 条
  • [1] Contrastive Transformation for Self-supervised Correspondence Learning
    Wang, Ning
    Zhou, Wengang
    Li, Hougiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10174 - 10182
  • [2] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
    Li, Xiang (xiangli8@cs.stonybrook.edu), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [3] Self-supervised learning in cooperative stereo vision correspondence
    Decoux, B
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 1997, 8 (01) : 101 - 111
  • [4] Self-Supervised Visual Descriptor Learning for Dense Correspondence
    Schmidt, Tanner
    Newcombe, Richard
    Fox, Dieter
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02): : 420 - 427
  • [5] Discriminative Spatiotemporal Alignment for Self-Supervised Video Correspondence Learning
    Wei, Qiaoqiao
    Zhang, Hui
    Yong, Jun-Hai
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1841 - 1846
  • [6] Spatial-then-Temporal Self-Supervised Learning for Video Correspondence
    Li, Rui
    Liu, Dong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2279 - 2288
  • [7] Joint-task Self-supervised Learning for Temporal Correspondence
    Li, Xueting
    Liu, Sifei
    De Mello, Shalini
    Wang, Xiaolong
    Kautz, Jan
    Yang, Ming-Hsuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation
    Li, Liulei
    Wang, Wenguan
    Zhou, Tianfei
    Li, Jianwu
    Yang, Yi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18706 - 18716
  • [9] SELF-SUPERVISED AUDIO SPATIALIZATION WITH CORRESPONDENCE CLASSIFIER
    Lu, Yu-Ding
    Lee, Hsin-Ying
    Tseng, Hung-Yu
    Yang, Ming-Hsuan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3347 - 3351
  • [10] Reformulating Graph Kernels for Self-Supervised Space-Time Correspondence Learning
    Qin, Zheyun
    Lu, Xiankai
    Liu, Dongfang
    Nie, Xiushan
    Yin, Yilong
    Shen, Jianbing
    Loui, Alexander C.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6543 - 6557