Blind Spot Detection for Safe Sim-to-Real Transfer

被引:0
|
作者
Ramakrishnan, Ramya [1 ]
Kamar, Ece [2 ]
Dey, Debadeepta [2 ]
Horvitz, Eric [2 ]
Shah, Julie [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Microsoft Res, 14865 NE 36th St, Redmond, WA 98052 USA
关键词
NOVELTY DETECTION; UNCERTAINTY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents trained in simulation may make errors when performing actions in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult for the agent to discover because the agent is unable to predict them a priori. In this work, we propose the use of oracle feedback to learn a predictive model of these blind spots in order to reduce costly errors in real-world applications. We focus on blind spots in reinforcement learning (RL) that occur due to incomplete state representation: when the agent lacks necessary features to represent the true state of the world, and thus cannot distinguish between numerous states. We formalize the problem of discovering blind spots in RL as a noisy supervised learning problem with class imbalance. Our system learns models for predicting blind spots within unseen regions of the state space by combining techniques for label aggregation, calibration, and supervised learning. These models take into consideration noise emerging from different forms of oracle feedback, including demonstrations and corrections. We evaluate our approach across two domains and demonstrate that it achieves higher predictive performance than baseline methods, and also that the learned model can be used to selectively query an oracle at execution time to prevent errors. We also empirically analyze the biases of various feedback types and how these biases influence the discovery of blind spots. Further, we include analyses of our approach that incorporate relaxed initial optimality assumptions. (Interestingly, relaxing the assumptions of an optimal oracle and an optimal simulator policy helped our models to perform better.) We also propose extensions to our method that are intended to improve performance when using corrections and demonstrations data.
引用
收藏
页码:191 / 234
页数:44
相关论文
共 50 条
  • [21] Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
    Lai, Hang
    Zhang, Weinan
    He, Xialin
    Yu, Chen
    Tian, Zheng
    Yu, Yong
    Wang, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5141 - 5147
  • [22] Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems
    Lin Q.
    Yu C.
    Wu X.-W.
    Dong Y.-Z.
    Xu X.
    Zhang Q.
    Guo X.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (02): : 711 - 738
  • [23] Learning Soft Millirobot Multimodal Locomotion with Sim-to-Real Transfer
    Demir, Sinan Ozgun
    Tiryaki, Mehmet Efe
    Karacakol, Alp Can
    Sitti, Metin
    ADVANCED SCIENCE, 2024, 11 (30)
  • [24] Bidirectional Sim-to-Real Transfer for GelSight Tactile Sensors With CycleGAN
    Chen, Weihang
    Xu, Yuan
    Chen, Zhenyang
    Zeng, Peiyu
    Dang, Renjun
    Chen, Rui
    Xu, Jing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6187 - 6194
  • [25] Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning
    Da, Longchao
    Gao, Minquan
    Mei, Hao
    Wei, Hua
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 82 - 90
  • [26] On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer
    Aljalbout, Elie
    Frank, Felix
    Karl, Maximilian
    van der Smagt, Patrick
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06): : 5895 - 5902
  • [27] Sim-to-real transfer of co-optimized soft robot crawlers
    Charles Schaff
    Audrey Sedal
    Shiyao Ni
    Matthew R. Walter
    Autonomous Robots, 2023, 47 : 1195 - 1211
  • [28] Sim-to-Real Policy and Reward Transfer with Adaptive Forward Dynamics Model
    Juan, Rongshun
    Ju, Hao
    Huang, Jie
    Gomez, Randy
    Nakamura, Keisuke
    Li, Guangliang
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7212 - 7218
  • [29] Adversarial discriminative sim-to-real transfer of visuo-motor policies
    Zhang, Fangyi
    Leitner, Jurgen
    Ge, Zongyuan
    Milford, Michael
    Corke, Peter
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (10-11): : 1229 - 1245
  • [30] AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
    Ren, Allen Z.
    Dai, Hongkai
    Burchfiel, Benjamin
    Majumdar, Anirudha
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229