Efficient Model Learning from Joint-Action Demonstrations for Human-Robot Collaborative Tasks

被引:126
|
作者
Nikolaidis, Stefanos [1 ]
Ramakrishnan, Ramya [1 ]
Gu, Keren [1 ]
Shah, Julie [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
10.1145/2696454.2696455
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for automatically learning human user models from joint-action demonstrations that enables a robot to compute a robust policy for a collaborative task with a human. First, the demonstrated action sequences are clustered into different human types using an unsupervised learning algorithm. A reward function is then learned for each type through the employment of an inverse reinforcement learning algorithm. The learned model is then incorporated into a mixed-observability Markov decision process (MOMDP) formulation, wherein the human type is a partially observable variable. With this framework, we can infer online the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this user. In a human subject experiment (n = 30), participants agreed more strongly that the robot anticipated their actions when working with a robot incorporating the proposed framework (p < 0.01), compared to manually annotating robot actions. In trials where participants faced difficulty annotating the robot actions to complete the task, the proposed framework significantly improved team efficiency (p < 0.01). The robot incorporating the framework was also found to be more responsive to human actions compared to policies computed using a hand-coded reward function by a domain expert (p < 0.01). These results indicate that learning human user models from joint-action demonstrations and encoding them in a MOMDP formalism can support effective teaming in human-robot collaborative tasks.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [31] A Human-Robot Collaborative Reinforcement Learning Algorithm
    Kartoun, Uri
    Stern, Helman
    Edan, Yael
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 60 (02) : 217 - 239
  • [32] Human-robot collaborative learning system for inspection
    Uri, Kartoun
    Helman, Stern
    Yael, Edan
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4249 - +
  • [33] Human-robot mutual adaptation in collaborative tasks: Models and experiments
    Nikolaidis, Stefanos
    Hsu, David
    Srinivasa, Siddhartha
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (5-7): : 618 - 634
  • [34] Human-robot collaborative interaction with human perception and action recognition
    Yu, Xinyi
    Zhang, Xin
    Xu, Chengjun
    Ou, Linlin
    NEUROCOMPUTING, 2024, 563
  • [35] Editorial: Shared Autonomy-Learning of Joint Action and Human-Robot Collaboration
    Schilling, Malte
    Burgard, Wolfram
    Muelling, Katharina
    Wrede, Britta
    Ritter, Helge
    FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [36] Learning Turn-Taking Behavior from Human Demonstrations for Social Human-Robot Interactions
    Shahverdi, Pourya
    Tyshka, Alexander
    Trombly, Madeline
    Louie, Wing-Yue Geoffrey
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7643 - 7649
  • [37] Human-Robot Interaction Using Learning from Demonstrations and a Wearable Glove with Multiple Sensors
    Singh, Rajmeet
    Mozaffari, Saeed
    Akhshik, Masoud
    Ahamed, Mohammed Jalal
    Rondeau-Gagne, Simon
    Alirezaee, Shahpour
    SENSORS, 2023, 23 (24)
  • [38] Learning Human-Robot Interactions from Human-Human Demonstrations (with Applications in Lego Rocket Assembly)
    Vogt, David
    Stepputtis, Simon
    Weinhold, Richard
    Jung, Bernhard
    Ben Amor, Heni
    2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 142 - 143
  • [39] Quantifying Hypothesis Space Misspecification in Learning From Human-Robot Demonstrations and Physical Corrections
    Bobu, Andreea
    Bajcsy, Andrea
    Fisac, Jaime F.
    Deglurkar, Sampada
    Dragan, Anca D.
    IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (03) : 835 - 854
  • [40] Learning Periodic Tasks from Human Demonstrations
    Yang, Jingyun
    Zhang, Junwu
    Settle, Connor
    Rai, Akshara
    Antonova, Rika
    Bohg, Jeannette
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8658 - 8665