HERD: Continuous Human-to-Robot Evolution for Learning from Human Demonstration

被引:0
|
作者
Liu, Xingyu [1 ]
Pathak, Deepak [1 ]
Kitani, Kris M. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
关键词
IMITATION; MUJOCO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to learn from human demonstration endows robots with the ability to automate various tasks. However, directly learning from human demonstration is challenging since the structure of the human hand can be very different from the desired robot gripper. In this work, we show that manipulation skills can be transferred from a human to a robot through the use of micro-evolutionary reinforcement learning, where a five-finger human dexterous hand robot gradually evolves into a commercial robot, while repeated interacting in a physics simulator to continuously update the policy that is first learned from human demonstration. To deal with the high dimensions of robot parameters, we propose an algorithm for multi-dimensional evolution path searching that allows joint optimization of both the robot evolution path and the policy. Through experiments on human object manipulation datasets, we show that our framework can efficiently transfer the expert human agent policy trained from human demonstrations in diverse modalities to target commercial robots.
引用
收藏
页码:447 / 458
页数:12
相关论文
共 50 条
  • [31] Exploring Implicit Human Responses to Robot Mistakes in a Learning from Demonstration Task
    Hayes, Cory J.
    Moosaei, Maryam
    Riek, Laurel D.
    2016 25TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2016, : 246 - 252
  • [32] ROBOT LEARNING FROM HUMAN DEMONSTRATION OF ACTIVITIES OF DAILY LIVING (ADL) TASKS
    Trivedi, Urvish
    Alqasemi, Redwan
    Dubey, Rajiv
    PROCEEDINGS OF ASME 2021 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION (IMECE2021), VOL 6, 2021,
  • [33] Learning from Demonstration Facilitates Human-Robot Collaborative Task Execution
    Koskinopoulou, Maria
    Piperakis, Stylimos
    Frahanias, Panos
    ELEVENTH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN ROBOT INTERACTION (HRI'16), 2016, : 59 - 66
  • [34] Communication and knowledge sharing in human-robot interaction and learning from demonstration
    Koenig, Nathan
    Takayama, Leila
    Mataric, Maja
    NEURAL NETWORKS, 2010, 23 (8-9) : 1104 - 1112
  • [35] The Effects of a Robot's Performance on Human Teachers for Learning from Demonstration Tasks
    Hedlund, Erin
    Johnson, Michael
    Gombolay, Matthew
    2021 16TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI, 2021, : 207 - 215
  • [36] LEARNING CONTRACTING NONLINEAR DYNAMICS FROM HUMAN DEMONSTRATION FOR ROBOT MOTION PLANNING
    Ravichandar, Harish
    Dani, Ashwin
    PROCEEDINGS OF THE ASME 8TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2015, VOL 2, 2016,
  • [37] Robot Performing Peg-in-Hole Operations by Learning from Human Demonstration
    Zhu, Zuyuan
    Hu, Huosheng
    Gu, Dongbing
    2018 10TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2018, : 30 - 35
  • [38] Reduced-dimension representations of human performance data for human-to-robot skill transfer
    Lee, C
    Xu, YS
    1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3: INNOVATIONS IN THEORY, PRACTICE AND APPLICATIONS, 1998, : 1956 - 1961
  • [39] Benchmark for Human-to-Robot Handovers of Unseen Containers With Unknown Filling
    Sanchez-Matilla, Ricardo
    Chatzilygeroudis, Konstantinos
    Modas, Apostolos
    Duarte, Nuno Ferreira
    Xompero, Alessio
    Frossard, Pascal
    Billard, Aude
    Cavallaro, Andrea
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 1642 - 1649
  • [40] HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers
    Chao, Yu-Wei
    Paxton, Chris
    Xiang, Yu
    Yang, Wei
    Sundaralingam, Balakumar
    Chen, Tao
    Murali, Adithyavairavan
    Cakmak, Maya
    Fox, Dieter
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6941 - 6947