Efficient Model Learning from Joint-Action Demonstrations for Human-Robot Collaborative Tasks

被引:126
|
作者
Nikolaidis, Stefanos [1 ]
Ramakrishnan, Ramya [1 ]
Gu, Keren [1 ]
Shah, Julie [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
10.1145/2696454.2696455
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for automatically learning human user models from joint-action demonstrations that enables a robot to compute a robust policy for a collaborative task with a human. First, the demonstrated action sequences are clustered into different human types using an unsupervised learning algorithm. A reward function is then learned for each type through the employment of an inverse reinforcement learning algorithm. The learned model is then incorporated into a mixed-observability Markov decision process (MOMDP) formulation, wherein the human type is a partially observable variable. With this framework, we can infer online the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this user. In a human subject experiment (n = 30), participants agreed more strongly that the robot anticipated their actions when working with a robot incorporating the proposed framework (p < 0.01), compared to manually annotating robot actions. In trials where participants faced difficulty annotating the robot actions to complete the task, the proposed framework significantly improved team efficiency (p < 0.01). The robot incorporating the framework was also found to be more responsive to human actions compared to policies computed using a hand-coded reward function by a domain expert (p < 0.01). These results indicate that learning human user models from joint-action demonstrations and encoding them in a MOMDP formalism can support effective teaming in human-robot collaborative tasks.
引用
下载
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [21] Methods for Providing Indications of Robot Intent in Collaborative Human-Robot Tasks
    Bejerano, Gal
    LeMasurier, Gregory
    Yanco, Holly A.
    COMPANION OF THE 2018 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'18), 2018, : 65 - 66
  • [22] Human Intention-Driven Learning Control for Trajectory Synchronization in Human-Robot Collaborative Tasks
    Ravichandar, Harish Chaandar
    Trombetta, Daniel
    Dani, Ashwin P.
    IFAC PAPERSONLINE, 2019, 51 (34): : 1 - 7
  • [23] Perception-Intention-Action Cycle as a Human Acceptable Way for Improving Human-Robot Collaborative Tasks
    Dominguez-Vidal, J. E.
    Rodriguez, Nicolas
    Sanfeliu, Alberto
    COMPANION OF THE ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2023, 2023, : 567 - 571
  • [24] MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions From Demonstrations
    Prasad, Vignesh
    Kshirsagar, Alap
    Koert, Dorothea
    Stock-Homburg, Ruth
    Peters, Jan
    Chalvatzaki, Georgia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6043 - 6050
  • [25] Decisional Issues for Human-Robot Joint Action
    Alami, Rachid
    PROCEEDINGS OF ROBOPHILOSOPHY - SOCIAL ROBOTS IN SOCIAL INSTITUTIONS, 2022, 366 : 23 - 23
  • [26] Key Elements for Human-Robot Joint Action
    Clodic, Aurelie
    Alami, Rachid
    Chatila, Raja
    SOCIABLE ROBOTS AND THE FUTURE OF SOCIAL RELATIONS, 2014, 273 : 23 - 33
  • [27] Analyzing Human Visual Attention in Human-Robot Collaborative Construction Tasks
    Liang, Xiaoyun
    Cai, Jiannan
    Hu, Yuqing
    CONSTRUCTION RESEARCH CONGRESS 2024: ADVANCED TECHNOLOGIES, AUTOMATION, AND COMPUTER APPLICATIONS IN CONSTRUCTION, 2024, : 856 - 865
  • [28] A Programming by Demonstration System for Human-Robot Collaborative Assembly Tasks
    Hamabe, Takuma
    Goto, Hiraki
    Miura, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1195 - 1201
  • [29] A Human-Robot Collaborative Reinforcement Learning Algorithm
    Uri Kartoun
    Helman Stern
    Yael Edan
    Journal of Intelligent & Robotic Systems, 2010, 60 : 217 - 239
  • [30] Prediction of Human Activity Patterns for Human-Robot Collaborative Assembly Tasks
    Zanchettin, Andrea Maria
    Casalino, Andrea
    Piroddi, Luigi
    Rocco, Paolo
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (07) : 3934 - 3942