Learning Performance Graphs From Demonstrations via Task-Based Evaluations

被引:2
|
作者
Puranic, Aniruddh G. [1 ]
Deshmukh, Jyotirmoy V. [1 ]
Nikolaidis, Stefanos [1 ]
机构
[1] Univ Southern Calif, Comp Sci Dept, Los Angeles, CA 90089 USA
关键词
Formal methods in robotics and automation; learning from demonstration; reinforcement learning; SIGNAL TEMPORAL LOGIC;
D O I
10.1109/LRA.2022.3226072
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the paradigm of robot learning-from-demonstra tions (LfD), understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions that lead to undesirable or unsafe control policies. Prior work has used temporal logic specifications, manually ranked by human experts based on their importance, to learn reward functions from imperfect/suboptimal demonstrations. To overcome reliance on expert rankings, we propose a novel algorithm that learns from demonstrations, a partial ordering of provided specifications in the form of a performance graph. Through various experiments, including simulation of industrial mobile robots, we show that extracting reward functions with the learned graph results in robot policies similar to those generated with the manually specified orderings. We also show in a user study that the learned orderings match the orderings or rankings by participants for demonstrations in a simulated driving domain. These results show that we can accurately evaluate demonstrations with respect to provided task specifications from a small set of imperfect data with minimal expert input.
引用
收藏
页码:336 / 343
页数:8
相关论文
共 50 条
  • [1] Task-based learning
    Race, P
    [J]. MEDICAL EDUCATION, 2000, 34 (05) : 335 - 336
  • [2] Task-based learning for pronunciation
    Lee, K
    [J]. INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION, VOLS I AND II, PROCEEDINGS, 2002, : 1500 - 1501
  • [3] TASK-BASED LEARNING IN EDUCATION
    Naznean, Andreea
    [J]. PROCEEDINGS OF THE EUROPEAN INTEGRATION: BETWEEN TRADITION AND MODERNITY, VOL 3, 2009, : 749 - 755
  • [4] A framework for task-based learning
    Yuan, FY
    [J]. TESOL QUARTERLY, 1999, 33 (01) : 157 - 158
  • [5] Verifying task-based specifications in conceptual graphs
    Lee, J
    Lai, LF
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 1998, 39 (14-15) : 913 - 923
  • [6] Task-based specifications through conceptual graphs
    Lee, J
    Lai, LF
    Huang, WT
    [J]. IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1996, 11 (04): : 60 - 70
  • [7] Learning Task Priorities From Demonstrations
    Silverio, Joao
    Calinon, Sylvain
    Rozo, Leonel
    Caldwell, Darwin G.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 78 - 94
  • [8] Task-Based Learning via Task-Oriented Prediction Network with Applications in Finance
    Chen, Di
    Zhu, Yada
    Cui, Xiaodong
    Gomes, Carla P.
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4476 - 4482
  • [9] Learning Task Specifications from Demonstrations
    Vazquez-Chanlatte, Marcell
    Jha, Susmit
    Tiwari, Ashish
    Ho, Mark K.
    Seshia, Sanjit A.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Task-based language learning and teaching
    Apelgren, BM
    [J]. MODERNA SPRAK, 2004, 98 (01): : 115 - 117