Learning Performance Graphs From Demonstrations via Task-Based Evaluations

被引:2
|
作者
Puranic, Aniruddh G. [1 ]
Deshmukh, Jyotirmoy V. [1 ]
Nikolaidis, Stefanos [1 ]
机构
[1] Univ Southern Calif, Comp Sci Dept, Los Angeles, CA 90089 USA
关键词
Formal methods in robotics and automation; learning from demonstration; reinforcement learning; SIGNAL TEMPORAL LOGIC;
D O I
10.1109/LRA.2022.3226072
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the paradigm of robot learning-from-demonstra tions (LfD), understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions that lead to undesirable or unsafe control policies. Prior work has used temporal logic specifications, manually ranked by human experts based on their importance, to learn reward functions from imperfect/suboptimal demonstrations. To overcome reliance on expert rankings, we propose a novel algorithm that learns from demonstrations, a partial ordering of provided specifications in the form of a performance graph. Through various experiments, including simulation of industrial mobile robots, we show that extracting reward functions with the learned graph results in robot policies similar to those generated with the manually specified orderings. We also show in a user study that the learned orderings match the orderings or rankings by participants for demonstrations in a simulated driving domain. These results show that we can accurately evaluate demonstrations with respect to provided task specifications from a small set of imperfect data with minimal expert input.
引用
收藏
页码:336 / 343
页数:8
相关论文
共 50 条
  • [21] Task-Based Language Learning: A Review of Issues
    Robinson, Peter
    [J]. LANGUAGE LEARNING, 2011, 61 : 1 - 36
  • [22] Task-based learning environments in a virtual university
    Whittington, D
    Campbell, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 707 - 709
  • [23] Task-based language learning and teaching.
    Mackey, A
    [J]. STUDIES IN SECOND LANGUAGE ACQUISITION, 2004, 26 (03) : 480 - 482
  • [24] Task-Based Neuromodulation Architecture for Lifelong Learning
    Daram, Anurag Reddy
    Kudithipudi, Dhireesha
    Yanguas-Gil, Angel
    [J]. PROCEEDINGS OF THE 2019 20TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2019, : 191 - 197
  • [25] ForeignLanguage Teaching Methods in Task-Based Learning
    Shukurova, Farida
    [J]. ARAB WORLD ENGLISH JOURNAL, 2024, 15 (01) : 44 - 55
  • [26] Robot learning of industrial assembly task via human demonstrations
    Maria Kyrarini
    Muhammad Abdul Haseeb
    Danijela Ristić-Durrant
    Axel Gräser
    [J]. Autonomous Robots, 2019, 43 : 239 - 257
  • [27] Dynamic Guidance for Task-Based Exploratory Learning
    Thomas, James M.
    Young, R. Michael
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION, 2011, 6738 : 369 - 376
  • [28] Task-Based Neuromodulation Architecture for Lifelong Learning
    Daram, Anurag Reddy
    Kudithipudi, Dhireesha
    Yanguas-Gil, Angel
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [29] Task-based language learning and teaching with technology
    Dettori, Giuliana
    [J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2011, 42 (05) : E114 - E115
  • [30] Task-Based Language Learning and Teaching with Technology
    Ziegler, Nicole
    Rock, Kristin
    [J]. KOREAN LANGUAGE IN AMERICA, 2016, 20 (01): : 95 - 97