Learning Sequential Tasks Interactively from Demonstrations and Own Experience

被引:0
|
作者
Graeve, Kathrin [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Bonn, Dept Comp Sci, Autonomous Intelligent Syst Grp, Bonn, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deploying robots to our day-to-day life requires them to have the ability to learn from their environment in order to acquire new task knowledge and to flexibly adapt existing skills to various situations. For typical real-world tasks, it is not sufficient to endow robots with a set of primitive actions. Rather, they need to learn how to sequence these in order to achieve a desired effect on their environment. In this paper, we propose an intuitive learning method for a robot to acquire sequences of motions by combining learning from human demonstrations and reinforcement learning. In every situation, our approach treats both ways of learning as alternative control flows to optimally exploit their strengths without inheriting their shortcomings. Using a Gaussian Process approximation of the state-action sequence value function, our approach generalizes values observed from demonstrated and autonomously generated action sequences to unknown inputs. This approximation is based on a kernel we designed to account for different representations of tasks and action sequences as well as inputs of variable length. From the expected deviation of value estimates, we devise a greedy exploration policy following a Bayesian optimization criterion that quickly converges learning to promising action sequences while protecting the robot from sequences with unpredictable outcome. We demonstrate the ability of our approach to efficiently learn appropriate action sequences in various situations on a manipulation task involving stacked boxes.
引用
收藏
页码:3237 / 3243
页数:7
相关论文
共 50 条
  • [31] GTI: Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations
    Mandlekar, Ajay
    Xu, Danfei
    Martin-Martin, Roberto
    Savarese, Silvio
    Li Fei-Fei
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [32] Solving Complex Tasks Hierarchically from Demonstrations
    Zheng, Wei
    Wu, Bo
    Lin, Hai
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1178 - 1183
  • [33] Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations
    Skrynnik, Alexey
    Staroverov, Aleksey
    Aitygulov, Ermek
    Aksenov, Kirill
    Davydov, Vasilii
    Panov, Aleksandr, I
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [34] Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations
    Zhu, Zhuangdi
    Lin, Kaixiang
    Dai, Bo
    Zhou, Jiayu
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9269 - 9277
  • [35] Explaining Multi-stage Tasks by Learning Temporal Logic Formulas from Suboptimal Demonstrations
    Chou, Glen
    Ozay, Necmiye
    Berenson, Dmitry
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [36] Learning From Sparse Demonstrations
    Jin, Wanxin
    Murphey, Todd D.
    Kulic, Dana
    Ezer, Neta
    Mou, Shaoshuai
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 645 - 664
  • [37] Learning to Generalize from Demonstrations
    Browne, Katie
    Nicolescu, Monica
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2012, 12 (03) : 27 - 38
  • [38] Learning from Corrective Demonstrations
    Gutierrez, Reymundo A.
    Short, Elaine Schaertl
    Niekum, Scott
    Thomaz, Andrea L.
    [J]. HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 712 - 714
  • [39] Autonomous learning of sequential tasks: Experiments and analyzes
    Sun, R
    Peterson, T
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06): : 1217 - 1234
  • [40] Facilitating Human-Robot Collaborative Tasks by Teaching-Learning-Collaboration From Human Demonstrations
    Wang, Weitian
    Li, Rui
    Chen, Yi
    Diekel, Z. Max
    Jia, Yunyi
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (02) : 640 - 653