Implicit Kinematic Policies: Unifying Joint and Cartesian Action Spaces in End-to-End Robot Learning

被引:4
|
作者
Ganapathi, Aditya [1 ,2 ]
Florence, Pete [1 ]
Varley, Jake [1 ]
Burns, Kaylee [1 ,3 ]
Goldberg, Ken [2 ]
Zeng, Andy [1 ]
机构
[1] Google, Robot, Mountain View, CA 94043 USA
[2] Univ Calif Berkeley, Berkeley, CA 90095 USA
[3] Stanford Univ, Stanford, CA USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年
关键词
D O I
10.1109/ICRA46639.2022.9812165
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Action representation is an important yet often overlooked aspect in end-to-end robot learning with deep networks. Choosing one action space over another (e.g. target joint positions, or Cartesian end-effector poses) can result in surprisingly stark performance differences between various downstream tasks - and as a result, considerable research has been devoted to finding the right action space for a given application. However, in this work, we instead investigate how our models can discover and learn for themselves which action space to use. Leveraging recent work on implicit behavioral cloning, which takes both observations and actions as input, we demonstrate that it is possible to present the same action in multiple different spaces to the same policy - allowing it to learn inductive patterns from each space. Specifically, we study the benefits of combining Cartesian and joint action spaces in the context of learning manipulation skills. To this end, we present Implicit Kinematic Policies (IKP), which incorporates the kinematic chain as a differentiable module within the deep network. Quantitative experiments across several simulated continuous control tasks-from scooping piles of small objects, to lifting boxes with elbows, to precise block insertion with miscalibrated robots-suggest IKP not only learns complex prehensile and non-prehensile manipulation from pixels better than baseline alternatives, but also can learn to compensate for small joint encoder offset errors. Finally, we also run qualitative experiments on a real UR5e to demonstrate the feasibility of our algorithm on a physical robotic system with real data. See https://tinyurl.com/4wz3nf86 for code and supplementary material.
引用
收藏
页码:2656 / 2662
页数:7
相关论文
共 50 条
  • [21] End-to-end Video-level Representation Learning for Action Recognition
    Zhu, Jiagang
    Zhu, Zheng
    Zou, Wei
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 645 - 650
  • [22] Comparative Investigation of Deep Learning Components for End-to-end Implicit Discourse Relationship Parser
    Li, Dejian
    Lan, Man
    Wu, Yuanbin
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 143 - 155
  • [23] Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization
    Gehring, Clement
    Kawaguchi, Kenji
    Huang, Jiaoyang
    Kaelbling, Leslie Pack
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [24] A Virtual End-to-End Learning System for Robot Navigation Based on Temporal Dependencies
    Zhang, Yanqiu
    Ge, Ruiquan
    Lyu, Lei
    Zhang, Jinling
    Lyu, Chen
    Yang, Xiaojuan
    IEEE ACCESS, 2020, 8 (08): : 134111 - 134123
  • [25] Towards End-to-End Control of a Robot Prosthetic Hand via Reinforcement Learning
    Sharif, Mohammadreza
    Erdogmus, Deniz
    Amato, Christopher
    Padir, Taskin
    2020 8TH IEEE RAS/EMBS INTERNATIONAL CONFERENCE FOR BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2020, : 641 - 647
  • [26] An End-to-end Approach for Learning and Generating Complex Robot Motions from Demonstration
    Kordia, Ali H.
    Melo, Francisco S.
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1008 - 1014
  • [27] END-TO-END CROWD COUNTING VIA JOINT LEARNING LOCAL AND GLOBAL COUNT
    Shang, Chong
    Ai, Haizhou
    Bai, Bo
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1215 - 1219
  • [28] End-to-end learning for joint depth and image reconstruction from diffracted rotation
    Mel, Mazen
    Siddiqui, Muhammad
    Zanuttigh, Pietro
    VISUAL COMPUTER, 2024, 40 (09): : 5961 - 5977
  • [29] End-to-End Learning for Joint Image Demosaicing, Denoising and Super-Resolution
    Xing, Wenzhu
    Egiazarian, Karen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3506 - 3515
  • [30] Image Representation of a City and Its Taxi Fleet for End-To-End Learning of Rebalancing Policies
    Gachter, Joel
    Zanardi, Alessandro
    Ruch, Claudio
    Frazzoli, Emilio
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8076 - 8082