Blind Spot Detection for Safe Sim-to-Real Transfer

被引:0
|
作者
Ramakrishnan, Ramya [1 ]
Kamar, Ece [2 ]
Dey, Debadeepta [2 ]
Horvitz, Eric [2 ]
Shah, Julie [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Microsoft Res, 14865 NE 36th St, Redmond, WA 98052 USA
关键词
NOVELTY DETECTION; UNCERTAINTY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents trained in simulation may make errors when performing actions in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult for the agent to discover because the agent is unable to predict them a priori. In this work, we propose the use of oracle feedback to learn a predictive model of these blind spots in order to reduce costly errors in real-world applications. We focus on blind spots in reinforcement learning (RL) that occur due to incomplete state representation: when the agent lacks necessary features to represent the true state of the world, and thus cannot distinguish between numerous states. We formalize the problem of discovering blind spots in RL as a noisy supervised learning problem with class imbalance. Our system learns models for predicting blind spots within unseen regions of the state space by combining techniques for label aggregation, calibration, and supervised learning. These models take into consideration noise emerging from different forms of oracle feedback, including demonstrations and corrections. We evaluate our approach across two domains and demonstrate that it achieves higher predictive performance than baseline methods, and also that the learned model can be used to selectively query an oracle at execution time to prevent errors. We also empirically analyze the biases of various feedback types and how these biases influence the discovery of blind spots. Further, we include analyses of our approach that incorporate relaxed initial optimality assumptions. (Interestingly, relaxing the assumptions of an optimal oracle and an optimal simulator policy helped our models to perform better.) We also propose extensions to our method that are intended to improve performance when using corrections and demonstrations data.
引用
收藏
页码:191 / 234
页数:44
相关论文
共 50 条
  • [1] Blind spot detection for safe sim-to-real transfer
    Ramakrishnan, Ramya
    Kamar, Ece
    Dey, Debadeepta
    Horvitz, Eric
    Shah, Julie
    Journal of Artificial Intelligence Research, 2020, 67 : 191 - 234
  • [2] Sim-to-Real Transfer for Biped Locomotion
    Yu, Wenhao
    Kumar, Visak C. V.
    Turk, Greg
    Liu, C. Karen
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3503 - 3510
  • [3] Sim-to-Real Transfer for Object Detection in Aerial Inspections of Transmission Towers
    Peterlevitz, Augusto J.
    Chinelatto, Mateus A.
    Menezes, Angelo G.
    Motta, Cezanne A. M.
    Pereira, Guilherme A. B.
    Lopes, Gustavo L.
    Souza, Gustavo De M.
    Rodrigues, Juan
    Godoy, Lilian C.
    Koller, Mario A. F. F.
    Cabral, Mateus O.
    Alves, Nicole E.
    Silva, Paulo H.
    Cherobin, Ricardo
    Yamamoto, Roberto A. O.
    Da Silva, Ricardo D.
    IEEE ACCESS, 2023, 11 : 110312 - 110327
  • [4] Auto-Tuned Sim-to-Real Transfer
    Du, Yuqing
    Watkins, Olivia
    Darrell, Trevor
    Abbeel, Pieter
    Pathak, Deepak
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1290 - 1296
  • [5] Sim-to-Real Transfer for Optical Tactile Sensing
    Ding, Zihan
    Lepora, Nathan F.
    Johns, Edward
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1639 - 1645
  • [6] DROPO: Sim-to-real transfer with offline domain randomization
    Tiboni, Gabriele
    Arndt, Karol
    Kyrki, Ville
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 166
  • [7] Sim-to-Real Transfer of Bolting Tasks with Tight Tolerance
    Son, Dongwon
    Yang, Hyunsoo
    Lee, Dongjun
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9056 - 9063
  • [8] Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer
    Alghonaim, Raghad
    Johns, Edward
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12802 - 12808
  • [9] Reinforced Grounded Action Transformation for Sim-to-Real Transfer
    Karnan, Haresh
    Desai, Siddharth
    Hanna, Josiah P.
    Warnell, Garrett
    Stone, Peter
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4397 - 4402
  • [10] Robust visual sim-to-real transfer for robotic manipulation
    Garcia, Ricardo
    Strudel, Robin
    Chen, Shizhe
    Arlaud, Etienne
    Laptev, Ivan
    Schmid, Cordelia
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 992 - 999