Training Robots Without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer

被引:7
|
作者
Kim, Heecheol [1 ]
Ohmura, Yoshiyuki [1 ]
Nagakubo, Akihiko [2 ]
Kuniyoshi, Yasuo [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Lab Intelligent Syst & Informat, Bunkyo ku, Tokyo 1130023, Japan
[2] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr, Tsukuba, Ibaraki 3058568, Japan
关键词
Imitation learning; deep learning in grasping and manipulation; dual arm manipulation; force and tactile sensing; MOVEMENTS; TASK; EYE;
D O I
10.1109/LRA.2023.3262423
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep imitation learning is promising for robot manipulation because it only requires demonstration samples. In this study, deep imitation learning is applied to tasks that require force feedback. However, existing demonstration methods have deficiencies; bilateral teleoperation requires a complex control scheme and is expensive, and kinesthetic teaching suffers from visual distractions from human intervention. This research proposes a new master-to-robot (M2R) policy transfer system that does not require robots for teaching force feedback-based manipulation tasks. The human directly demonstrates a task using a controller. This controller resembles the kinematic parameters of the robot arm and uses the same end-effector with force/torque (F/T) sensors to measure the force feedback. Using this controller, the operator can feel force feedback without a bilateral system. The proposed method can overcome domain gaps between the master and robot using gaze-based imitation learning and a simple calibration method. Furthermore, a Transformer is applied to infer policy from F/T sensory input. The proposed system was evaluated on a bottle-cap-opening task that requires force feedback.
引用
收藏
页码:2906 / 2913
页数:8
相关论文
共 50 条
  • [31] Imitation and mirror systems in robots through Deep Modality Blending Networks
    Seker, M. Yunus
    Ahmetoglu, Alper
    Nagai, Yukie
    Asada, Minoru
    Oztop, Erhan
    Ugur, Emre
    NEURAL NETWORKS, 2022, 146 : 22 - 35
  • [32] Deep Transfer Learning for Wall Bulge Endpoints Regression for Autonomous Decoration Robots
    Eldosoky, Mahmoud A.
    Zeng, Fanyu
    Jiang, Xin
    Ge, Shuzhi Sam
    IEEE ACCESS, 2022, 10 : 73945 - 73955
  • [33] Cleaning Tasks Knowledge Transfer Between Heterogeneous Robots: a Deep Learning Approach
    Kim, Jaeseok
    Cauli, Nino
    Vicente, Pedro
    Damas, Bruno
    Bernardino, Alexandre
    Santos-Victor, Jose
    Cavallo, Filippo
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (01) : 191 - 205
  • [34] Cleaning Tasks Knowledge Transfer Between Heterogeneous Robots: a Deep Learning Approach
    Jaeseok Kim
    Nino Cauli
    Pedro Vicente
    Bruno Damas
    Alexandre Bernardino
    José Santos-Victor
    Filippo Cavallo
    Journal of Intelligent & Robotic Systems, 2020, 98 : 191 - 205
  • [35] Measurement of robot similarity to determine the best demonstrator for imitation in a group of heterogeneous robots
    Golombek, Raphael
    Richert, Willi
    Kleinjohann, Bernd
    Adelt, Philipp
    BIOLOGICALLY-INSPIRED COLLABORATIVE COMPUTING, 2008, 268 : 105 - 114
  • [36] Enhancing human-robot communication with a comprehensive language-conditioned imitation policy for embodied robots in smart cities
    Ju, Zhaoxun
    Wang, Hongbo
    Luo, Jingjing
    Sun, Fuchun
    COMPUTER COMMUNICATIONS, 2024, 22 : 177 - 187
  • [37] Policy gradient learning for quadruped soccer robots
    Cherubini, A.
    Giannone, F.
    Iocchi, L.
    Nardi, D.
    Palamara, P. F.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2010, 58 (07) : 872 - 878
  • [38] Torque-Based Deep Reinforcement Learning for Task-and-Robot Agnostic Learning on Bipedal Robots Using Sim-to-Real Transfer
    Kim, Donghyeon
    Berseth, Glen
    Schwartz, Mathew
    Park, Jaeheung
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6251 - 6258
  • [39] Reinforcement Imitation Learning Method Based on Collision Prediction for Robots Navigation
    Wang, Haojie
    Tao, Ye
    Lu, Chaofeng
    Computer Engineering and Applications, 60 (10): : 341 - 352
  • [40] IMITATION LEARNING OF DUAL-ARM MANIPULATION TASKS IN HUMANOID ROBOTS
    Asfour, T.
    Azad, P.
    Gyarfas, F.
    Dillmann, R.
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2008, 5 (02) : 183 - 202