TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning

被引：1

作者：

Kim, Taewoo ^{[1
]}

Lee, Joo-Haeng ^{[2
]}

机构：

[1] Korea Univ Sci & Technol, Dept Comp Software & Engn, Daejeon, South Korea

[2] Elect & Telecommun Res Inst, Human Robot Interact Res Grp, Daejeon, South Korea

来源：

2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN) | 2019年

关键词：

D O I：

10.1109/ro-man46459.2019.8956326

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Motion imitation is a fundamental communication skill for a robot; especially, as a nonverbal interaction with a human. Owing to kinematic configuration differences between the human and the robot, it is challenging to determine the appropriate mapping between the two pose domains. Moreover, technical limitations while extracting 3D motion details, such as wrist joint movements from human motion videos, results in significant challenges in motion retargeting. Explicit mapping over different motion domains indicates a considerably inefficient solution. To solve these problems, we propose a three-phase reinforcement learning scheme to enable a NAO robot to learn motions from human pose skeletons extracted from video inputs. Our learning scheme consists of three phases: (i) phase one for learning preparation, (ii) phase two for a simulation-based reinforcement learning, and (iii) phase three for a human-in-the-loop-based reinforcement learning. In phase one, embeddings of the motions of a human skeleton and robot are learned by an autoencoder. In phase two, the NAO robot learns a rough imitation skill using reinforcement learning that translates the learned embeddings. In the last phase, the robot learns motion details that were not considered in the previous phases by interactively setting rewards based on direct teaching instead of the method used in the previous phase. Especially, it is to be noted that a relatively smaller number of interactive inputs are required for motion details in phase three when compared to the large volume of training sets required for overall imitation in phase two. The experimental results demonstrate that the proposed method improves the imitation skills efficiently for hand waving and saluting motions obtained from NTU-DB.

引用

页数：8

共 50 条

[1] Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning
Savevska, Kristina
Ude, Ales
ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2023, 2023, 135 : 53 - 61
[2] Reinforcement learning in robotic motion planning by combined-based and self-imitation
Luo, Sha
Schomaker, Lambert
ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 170
[3] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
Chen, Hanxiao
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
[4] Integration of Evolutionary Computing and Reinforcement Learning for Robotic Imitation Learning
Tan, Huan
Balajee, Kannan
Lynn, DeRose
2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 407 - 412
[5] Obstacle-Avoidable Robotic Motion Planning Framework Based on Deep Reinforcement Learning
Liu, Huashan
Ying, Fengkang
Jiang, Rongxin
Shan, Yinghao
Shen, Bo
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (06) : 4377 - 4388
[6] A framework for robotic clothing assistance by imitation learning
Joshi, R. P.
Koganti, N.
Shibata, T.
ADVANCED ROBOTICS, 2019, 33 (22) : 1156 - 1174
[7] Physics-based Motion Capture Imitation with Deep Reinforcement Learning
Chentanez, Nuttapong
Muller, Matthias
Macklin, Miles
Makoviychuk, Viktor
Jeschke, Stefan
ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018), 2018,
[8] AMI: Adaptive Motion Imitation Algorithm Based on Deep Reinforcement Learning
Taghavi, Nazita
Alqatamin, Moath H. A.
Popa, Dan O.
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
[9] An Intelligent Framework for English Teaching through Deep Learning and Reinforcement Learning with Interactive Mobile Technology
Hu J.
Jin G.
Int. J. Interact. Mob. Technol., 2024, 9 (74-87): : 74 - 87
[10] Robotic Arm Motion Planning Based on Residual Reinforcement Learning
Zhou, Dongxu
Jia, Ruiqing
Yao, Haifeng
Xie, Mingzuo
2021 THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2021), 2021, : 89 - 94

← 1 2 3 4 5 →