TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning

被引:1
|
作者
Kim, Taewoo [1 ]
Lee, Joo-Haeng [2 ]
机构
[1] Korea Univ Sci & Technol, Dept Comp Software & Engn, Daejeon, South Korea
[2] Elect & Telecommun Res Inst, Human Robot Interact Res Grp, Daejeon, South Korea
来源
2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN) | 2019年
关键词
D O I
10.1109/ro-man46459.2019.8956326
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motion imitation is a fundamental communication skill for a robot; especially, as a nonverbal interaction with a human. Owing to kinematic configuration differences between the human and the robot, it is challenging to determine the appropriate mapping between the two pose domains. Moreover, technical limitations while extracting 3D motion details, such as wrist joint movements from human motion videos, results in significant challenges in motion retargeting. Explicit mapping over different motion domains indicates a considerably inefficient solution. To solve these problems, we propose a three-phase reinforcement learning scheme to enable a NAO robot to learn motions from human pose skeletons extracted from video inputs. Our learning scheme consists of three phases: (i) phase one for learning preparation, (ii) phase two for a simulation-based reinforcement learning, and (iii) phase three for a human-in-the-loop-based reinforcement learning. In phase one, embeddings of the motions of a human skeleton and robot are learned by an autoencoder. In phase two, the NAO robot learns a rough imitation skill using reinforcement learning that translates the learned embeddings. In the last phase, the robot learns motion details that were not considered in the previous phases by interactively setting rewards based on direct teaching instead of the method used in the previous phase. Especially, it is to be noted that a relatively smaller number of interactive inputs are required for motion details in phase three when compared to the large volume of training sets required for overall imitation in phase two. The experimental results demonstrate that the proposed method improves the imitation skills efficiently for hand waving and saluting motions obtained from NTU-DB.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning
    Savevska, Kristina
    Ude, Ales
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2023, 2023, 135 : 53 - 61
  • [2] Reinforcement learning in robotic motion planning by combined-based and self-imitation
    Luo, Sha
    Schomaker, Lambert
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 170
  • [3] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [4] Integration of Evolutionary Computing and Reinforcement Learning for Robotic Imitation Learning
    Tan, Huan
    Balajee, Kannan
    Lynn, DeRose
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 407 - 412
  • [5] Obstacle-Avoidable Robotic Motion Planning Framework Based on Deep Reinforcement Learning
    Liu, Huashan
    Ying, Fengkang
    Jiang, Rongxin
    Shan, Yinghao
    Shen, Bo
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (06) : 4377 - 4388
  • [6] A framework for robotic clothing assistance by imitation learning
    Joshi, R. P.
    Koganti, N.
    Shibata, T.
    ADVANCED ROBOTICS, 2019, 33 (22) : 1156 - 1174
  • [7] Physics-based Motion Capture Imitation with Deep Reinforcement Learning
    Chentanez, Nuttapong
    Muller, Matthias
    Macklin, Miles
    Makoviychuk, Viktor
    Jeschke, Stefan
    ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018), 2018,
  • [8] AMI: Adaptive Motion Imitation Algorithm Based on Deep Reinforcement Learning
    Taghavi, Nazita
    Alqatamin, Moath H. A.
    Popa, Dan O.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [9] An Intelligent Framework for English Teaching through Deep Learning and Reinforcement Learning with Interactive Mobile Technology
    Hu J.
    Jin G.
    Int. J. Interact. Mob. Technol., 2024, 9 (74-87): : 74 - 87
  • [10] Robotic Arm Motion Planning Based on Residual Reinforcement Learning
    Zhou, Dongxu
    Jia, Ruiqing
    Yao, Haifeng
    Xie, Mingzuo
    2021 THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2021), 2021, : 89 - 94