Blind Spot Detection for Safe Sim-to-Real Transfer

被引:0
|
作者
Ramakrishnan, Ramya [1 ]
Kamar, Ece [2 ]
Dey, Debadeepta [2 ]
Horvitz, Eric [2 ]
Shah, Julie [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Microsoft Res, 14865 NE 36th St, Redmond, WA 98052 USA
关键词
NOVELTY DETECTION; UNCERTAINTY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents trained in simulation may make errors when performing actions in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult for the agent to discover because the agent is unable to predict them a priori. In this work, we propose the use of oracle feedback to learn a predictive model of these blind spots in order to reduce costly errors in real-world applications. We focus on blind spots in reinforcement learning (RL) that occur due to incomplete state representation: when the agent lacks necessary features to represent the true state of the world, and thus cannot distinguish between numerous states. We formalize the problem of discovering blind spots in RL as a noisy supervised learning problem with class imbalance. Our system learns models for predicting blind spots within unseen regions of the state space by combining techniques for label aggregation, calibration, and supervised learning. These models take into consideration noise emerging from different forms of oracle feedback, including demonstrations and corrections. We evaluate our approach across two domains and demonstrate that it achieves higher predictive performance than baseline methods, and also that the learned model can be used to selectively query an oracle at execution time to prevent errors. We also empirically analyze the biases of various feedback types and how these biases influence the discovery of blind spots. Further, we include analyses of our approach that incorporate relaxed initial optimality assumptions. (Interestingly, relaxing the assumptions of an optimal oracle and an optimal simulator policy helped our models to perform better.) We also propose extensions to our method that are intended to improve performance when using corrections and demonstrations data.
引用
收藏
页码:191 / 234
页数:44
相关论文
共 50 条
  • [31] Grasp Stability Prediction with Sim-to-Real Transfer from Tactile Sensing
    Si, Zilin
    Zhu, Zirui
    Agarwal, Arpit
    Anderson, Stuart
    Yuan, Wenzhen
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7809 - 7816
  • [32] Unsupervised Adversarial Domain Adaptation for Sim-to-Real Transfer of Tactile Images
    Jing, Xingshuo
    Qian, Kun
    Jianu, Tudor
    Luo, Shan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [33] Sim-to-real transfer of co-optimized soft robot crawlers
    Schaff, Charles
    Sedal, Audrey
    Ni, Shiyao
    Walter, Matthew R.
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 1195 - 1211
  • [34] Multiplicative Controller Fusion: Leveraging Algorithmic Priors for Sample-efficient Reinforcement Learning and Safe Sim-To-Real Transfer
    Rana, Krishan
    Dasagi, Vibhavari
    Talbot, Ben
    Milford, Michael
    Sunderhauf, Niko
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6069 - 6076
  • [35] Sim-to-Real in Reinforcement Learning for Everyone
    Vacaro, Juliano
    Marques, Guilherme
    Oliveira, Bruna
    Paz, Gabriel
    Paula, Thomas
    Staehler, Wagston
    Murphy, David
    2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 305 - 310
  • [36] Sim-to-Real Domain Adaptation for Lane Detection and Classification in Autonomous Driving
    Hu, Chuqing
    Hudson, Sinclair
    Ethier, Martin
    Al-Sharman, Mohammad
    Rayside, Derek
    Melek, William
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 457 - 463
  • [37] Investigating the Sim-to-Real Generalizability of Deep Learning Object Detection Models
    Rueter, Joachim
    Durak, Umut
    Dauer, Johann C.
    JOURNAL OF IMAGING, 2024, 10 (10)
  • [38] Human-Guided Reinforcement Learning With Sim-to-Real Transfer for Autonomous Navigation
    Wu, Jingda
    Zhou, Yanxin
    Yang, Haohan
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14745 - 14759
  • [39] Plug-and-Play Sparse Inertial Motion Tracking With Sim-to-Real Transfer
    Bachhuber, Simon
    Lehmann, Dustin
    Dorschky, Eva
    Koelewijn, Anne D.
    Seel, Thomas
    Weygers, Ive
    IEEE SENSORS LETTERS, 2023, 7 (10)
  • [40] Pose Estimation for Robot Manipulators via Keypoint Optimization and Sim-to-Real Transfer
    Lu, Jingpei
    Richter, Florian
    Yip, Michael C.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 4622 - 4629