Objective learning from human demonstrations

被引:6
|
作者
Lin, Jonathan Feng-Shun [1 ]
Carreno-Medrano, Pamela [2 ]
Parsapour, Mahsa [3 ]
Sakr, Maram [2 ,4 ]
Kulic, Dana [2 ]
机构
[1] Univ Waterloo, Syst Design Engn, Waterloo, ON, Canada
[2] Monash Univ, Fac Engn, Clayton, Vic, Australia
[3] Univ Waterloo, Elect & Comp Engn, Waterloo, ON, Canada
[4] Univ British Columbia, Mech Engn, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Reward learning; Inverse optimal control; Inverse reinforcement learning; INVERSE OPTIMAL-CONTROL; COST-FUNCTIONS; GENERATION; ROBOT;
D O I
10.1016/j.arcontrol.2021.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Researchers in biomechanics, neuroscience, human-machine interaction and other fields are interested in inferring human intentions and objectives from observed actions. The problem of inferring objectives from observations has received extensive theoretical and methodological development from both the controls and machine learning communities. In this paper, we provide an integrating view of objective learning from human demonstration data. We differentiate algorithms based on the assumptions made about the objective function structure, how the similarity between the inferred objectives and the observed demonstrations is assessed, the assumptions made about the agent and environment model, and the properties of the observed human demonstrations. We review the application domains and validation approaches of existing works and identify the key open challenges and limitations. The paper concludes with an identification of promising directions for future work.
引用
收藏
页码:111 / 129
页数:19
相关论文
共 50 条
  • [1] Learning Periodic Tasks from Human Demonstrations
    Yang, Jingyun
    Zhang, Junwu
    Settle, Connor
    Rai, Akshara
    Antonova, Rika
    Bohg, Jeannette
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8658 - 8665
  • [2] Learning Adaptive Grasping From Human Demonstrations
    Wang, Shuaijun
    Hu, Wenbin
    Sun, Lining
    Wang, Xin
    Li, Zhibin
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (05) : 3865 - 3873
  • [3] Learning Manipulation Actions from Human Demonstrations
    Welschehold, Tim
    Dornhege, Christian
    Burgard, Wolfram
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 3772 - 3777
  • [4] Inferring preferences from demonstrations in multi-objective reinforcement learning
    Lu, Junlin
    Mannion, Patrick
    Mason, Karl
    Neural Computing and Applications, 2024, 36 (36) : 22845 - 22865
  • [5] Robot learning from human demonstrations with inconsistent contexts
    Qian, Zhifeng
    You, Mingyu
    Zhou, Hongjun
    Xu, Xuanhui
    He, Bin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 166
  • [6] Learning Symbolic Representations of Actions from Human Demonstrations
    Ahmadzadeh, Seyed Reza
    Paikan, Ali
    Mastrogiovanni, Fulvio
    Natale, Lorenzo
    Kormushev, Petar
    Caldwell, Darwin G.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3801 - 3808
  • [7] Learning ultrasound scanning skills from human demonstrations
    Xutian DENG
    Ziwei LEI
    Yi WANG
    Wen CHENG
    Zhao GUO
    Chenguang YANG
    Miao LI
    ScienceChina(InformationSciences), 2022, 65 (08) : 275 - 276
  • [8] Learning Motion and Impedance Behaviors from Human Demonstrations
    Saveriano, Matteo
    Lee, Dongheui
    2014 11TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2014, : 368 - 373
  • [9] Learning ultrasound scanning skills from human demonstrations
    Deng, Xutian
    Lei, Ziwei
    Wang, Yi
    Cheng, Wen
    Guo, Zhao
    Yang, Chenguang
    Li, Miao
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (08)
  • [10] Learning ultrasound scanning skills from human demonstrations
    Xutian Deng
    Ziwei Lei
    Yi Wang
    Wen Cheng
    Zhao Guo
    Chenguang Yang
    Miao Li
    Science China Information Sciences, 2022, 65