Extending Policy from One-Shot Learning through Coaching

被引:1
|
作者
Balakuntala, Mythra V. [1 ]
Venkatesh, Vishnunandan L. N. [1 ]
Bindu, Jyothsna Padmakumar [1 ]
Voyles, Richard M. [1 ]
Wachs, Juan [2 ]
机构
[1] Purdue Univ, Sch Engn Technol, W Lafayette, IN 47907 USA
[2] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/ro-man46459.2019.8956364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans generally teach their fellow collaborators to perform tasks through a small number of demonstrations, often followed by episodes of coaching that tune and refine the execution during practice. Adopting a similar framework for teaching robots through demonstrations makes teaching tasks highly intuitive and imitating the refinement of complex tasks through coaching improves the efficacy. Unlike traditional Learning from Demonstration (LfD) approaches which rely on multiple demonstrations to train a task, we present a novel one-shot learning from demonstration approach, augmented by coaching, to transfer the task from task expert to robot. The demonstration is automatically segmented into a sequence of a priori skills (the task policy) parametrized to match task goals. During practice, the robotic skills self-evaluate their performances and refine the task policy to locally optimize cumulative performance. Then, human coaching further refines the task policy to explore and globally optimize the net performance. Both the self-evaluation and coaching are implemented using reinforcement learning (RL) methods. The proposed approach is evaluated using the task of scooping and unscooping granular media. The self-evaluator of the scooping skill uses the real-time force signature and resistive force theory to minimize scooping resistance similar to how humans scoop. Coaching feedback focuses modifications to sub-domains of the action space, using RL to converge to desired performance. Thus, the proposed method provides a framework for learning tasks from one demonstration and generalizing it using human feedback through coaching achieving a success rate of approximate to 90%.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Embodied Gesture Learning from One-Shot
    Cabrera, Maria E.
    Wachs, Juan P.
    [J]. 2016 25TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2016, : 1092 - 1097
  • [2] One-Shot Imitation Learning
    Duan, Yan
    Andrychowicz, Marcin
    Stadie, Bradly
    Ho, Jonathan
    Schneider, Jonas
    Sutskeyer, Ilya
    Abbeel, Pieter
    Zaremba, Wojciech
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [3] Autonomous Driving Policy Continual Learning With One-Shot Disengagement Case
    Cao, Zhong
    Li, Xiang
    Jiang, Kun
    Zhou, Weitao
    Liu, Xiaoyu
    Deng, Nanshan
    Yang, Diange
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1380 - 1391
  • [4] One-Shot Learning for Landmarks Detection
    Wang, Zihao
    Vandersteen, Clair
    Raffaelli, Charles
    Guevara, Nicolas
    Patou, Francois
    Delingette, Herve
    [J]. DEEP GENERATIVE MODELS, AND DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, 2021, 13003 : 163 - 172
  • [5] Local Contrast Learning for One-Shot Learning
    Zhang, Yang
    Yuan, Xinghai
    Luo, Ling
    Yang, Yulu
    Zhang, Shihao
    Xu, Chuanyun
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [6] Demystification of Few-shot and One-shot Learning
    Tyukin, Ivan Y.
    Gorban, Alexander N.
    Alkhudaydi, Muhammad H.
    Zhou, Qinghua
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] One-Shot Learning on Attributed Sequences
    Zhuang, Zhongfang
    Kong, Xiangnan
    Rundensteiner, Elke
    Arora, Aditya
    Zouaoui, Jihane
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 921 - 930
  • [8] Domain Adaption in One-Shot Learning
    Dong, Nanqing
    Xing, Eric P.
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I, 2019, 11051 : 573 - 588
  • [9] The role of one-shot learning in # TheDress
    Daoudi, Leila Drissi
    Doerig, Adrien
    Parkosadze, Khatuna
    Kunchulia, Marina
    Herzog, Michael H.
    [J]. JOURNAL OF VISION, 2017, 17 (03):
  • [10] One-shot learning of object categories
    Li, FF
    Fergus, R
    Perona, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (04) : 594 - 611