Blind Spot Detection for Safe Sim-to-Real Transfer

被引:0
|
作者
Ramakrishnan, Ramya [1 ]
Kamar, Ece [2 ]
Dey, Debadeepta [2 ]
Horvitz, Eric [2 ]
Shah, Julie [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Microsoft Res, 14865 NE 36th St, Redmond, WA 98052 USA
关键词
NOVELTY DETECTION; UNCERTAINTY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agents trained in simulation may make errors when performing actions in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult for the agent to discover because the agent is unable to predict them a priori. In this work, we propose the use of oracle feedback to learn a predictive model of these blind spots in order to reduce costly errors in real-world applications. We focus on blind spots in reinforcement learning (RL) that occur due to incomplete state representation: when the agent lacks necessary features to represent the true state of the world, and thus cannot distinguish between numerous states. We formalize the problem of discovering blind spots in RL as a noisy supervised learning problem with class imbalance. Our system learns models for predicting blind spots within unseen regions of the state space by combining techniques for label aggregation, calibration, and supervised learning. These models take into consideration noise emerging from different forms of oracle feedback, including demonstrations and corrections. We evaluate our approach across two domains and demonstrate that it achieves higher predictive performance than baseline methods, and also that the learned model can be used to selectively query an oracle at execution time to prevent errors. We also empirically analyze the biases of various feedback types and how these biases influence the discovery of blind spots. Further, we include analyses of our approach that incorporate relaxed initial optimality assumptions. (Interestingly, relaxing the assumptions of an optimal oracle and an optimal simulator policy helped our models to perform better.) We also propose extensions to our method that are intended to improve performance when using corrections and demonstrations data.
引用
收藏
页码:191 / 234
页数:44
相关论文
共 50 条
  • [41] Solving a Simple Geduldspiele Cube with a Robotic Gripper via Sim-to-Real Transfer
    Yoo, Ji-Hyeon
    Jung, Ho-Jin
    Kim, Jang-Hyeon
    Sim, Dae-Han
    Yoon, Han-Ul
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [42] A Survey of Sim-to-Real Transfer Techniques Applied to Reinforcement Learning for Bioinspired Robots
    Zhu, Wei
    Guo, Xian
    Owaki, Dai
    Kutsuzawa, Kyo
    Hayashibe, Mitsuhiro
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3444 - 3459
  • [43] Sim-to-Real Transfer with Action Mapping and State Prediction for Robot Motion Control
    Zhu, Xianjin
    Zheng, Xudong
    Zhang, Qiyuan
    Chen, Zhang
    Liu, Yu
    Liang, Bin
    2021 6TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS), 2021, : 39 - 44
  • [44] A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies
    Bharadhwaj, Homanga
    Wang, Zihan
    Bengio, Yoshua
    Paull, Liam
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 782 - 788
  • [45] Contact Reduction with Bounded Stiffness for Robust Sim-to-Real Transfer of Robot Assembly
    Vuong, Nghia
    Pham, Quang-Cuong
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 361 - 367
  • [46] Toward Sim-to-Real Directional Semantic Grasping
    Iqbal, Shariq
    Tremblay, Jonathan
    Campbell, Andy
    Leung, Kirby
    To, Thang
    Cheng, Jia
    Leitch, Erik
    McKay, Duncan
    Birchfield, Stan
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7247 - 7253
  • [47] SIM-TO-REAL TRANSFER OF VISUAL GROUNDING FOR HUMAN-AIDED AMBIGUITY RESOLUTION
    Tziafas, Georgios
    Kasaei, Hamidreza
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [48] Sim-to-Real Transfer Reinforcement Learning for Position Control of Pneumatic Continuum Manipulator
    Cheng, Qiang
    Liu, Hongshuai
    Gao, Xifeng
    Zhang, Ying
    Hao, Lina
    IEEE ACCESS, 2023, 11 : 126110 - 126118
  • [49] Sim-to-real transfer of active suspension control using deep reinforcement learning
    Wiberg, Viktor
    Wallin, Erik
    Falldin, Arvid
    Semberg, Tobias
    Rossander, Morgan
    Wadbro, Eddie
    Servin, Martin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 179
  • [50] Benchmarking the Sim-to-Real Gap in Cloth Manipulation
    Blanco-Mulero, David
    Barbany, Oriol
    Alcan, Gokhan
    Colome, Adria
    Torras, Carme
    Kyrki, Ville
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03) : 2981 - 2988