Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive Imitation Learning

被引:0
|
作者
Diaz, Manfred [1 ]
Fevens, Thomas [2 ]
Paull, Liam [1 ]
机构
[1] Univ Montreal, Mila Dept Comp Sci & Operat Res, Montreal, PQ, Canada
[2] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
关键词
D O I
10.1109/CRV52889.2021.00018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Teaching robots how to execute tasks through demonstrations is appealing since it sidesteps the need to explicitly specify a reward function. However, posing imitation learning as a simple supervised learning problem suffers from the well-known problem of distributional shift - the teacher will only demonstrate the optimal trajectory and therefore the learner is unable to recover if it deviates even slightly from this trajectory since it has no training data for this case. This problem has been overcome in the literature by some element of interactivity in the learning process - usually be somehow interleaving the execution of the learner and the teacher so that the teacher can demonstrate to the learner also how to recover from mistakes. In this paper, we consider the cases where the robot has the potential to do harm, and therefore safety must be imposed at every step in the learning process. We show that uncertainty is an appropriate measure of safety and that both the mixing of the policies and the data sampling procedure benefit from considering the uncertainty of both the learner and the teacher. Our method, uncertainty-aware policy sampling and mixing (UPMS), is used to teach an agent to drive down a lane with less safety violations and less queries to the teacher than state-of-the-art methods.
引用
收藏
页码:72 / 78
页数:7
相关论文
共 50 条
  • [1] Uncertainty-Aware Data Aggregation for Deep Imitation Learning
    Cui, Yuchen
    Isele, David
    Niekum, Scott
    Fujimura, Kikuo
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 761 - 767
  • [2] Uncertainty-Aware Imitation Learning using Kernelized Movement Primitives
    Silverio, Joao
    Huang, Yanlong
    Abu-Dakka, Fares J.
    Rozo, Leonel
    Caldwell, Darwin G.
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 90 - 97
  • [3] Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth Completion
    Taguchi, Kensuke
    Morita, Shogo
    Hayashi, Yusuke
    Imaeda, Wataru
    Fujiyoshi, Hironobu
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3027 - 3035
  • [4] A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation
    Zheng, Ervine
    Yu, Qi
    Li, Rui
    Shi, Pengcheng
    Haake, Anne
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6030 - 6038
  • [5] Uncertainty-Aware Instance Reweighting for Off-Policy Learning
    Zhang, Xiaoying
    Chen, Junpu
    Wang, Hongning
    Xie, Hong
    Liu, Yang
    Lui, John C. S.
    Li, Hang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction
    Jiang, Jesse
    Zhao, Ye
    Coogan, Samuel
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2641 - 2646
  • [7] Safe Model-Based Reinforcement Learning With an Uncertainty-Aware Reachability Certificate
    Yu, Dongjie
    Zou, Wenjun
    Yang, Yujie
    Ma, Haitong
    Li, Shengbo Eben
    Yin, Yuming
    Chen, Jianyu
    Duan, Jingliang
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 21 (03) : 1 - 14
  • [8] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
    Kuo, Cheng-Yu
    Schaarschmidt, Andreas
    Cui, Yunduan
    Asfour, Tamim
    Matsubara, Takamitsu
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
  • [9] Uncertainty-Aware Reinforcement Learning for Safe Control of Autonomous Vehicles in Signalized Intersections
    Emamifar, Mehrnoosh
    Ghoreishi, Seyede Fatemeh
    [J]. 2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 81 - 82
  • [10] Uncertainty-aware automated machine learning toolbox
    Dorst, Tanja
    Schneider, Tizian
    Eichstaedt, Sascha
    Schuetze, Andreas
    [J]. TM-TECHNISCHES MESSEN, 2023, 90 (03) : 141 - 153