Safe Reinforcement Learning via Confidence-Based Filters

被引:1
|
作者
Curi, Sebastian [1 ]
Lederer, Armin [2 ]
Hirche, Sandra [2 ]
Krause, Andreas [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, ILAS Grp, Zurich, Switzerland
[2] Tech Univ Munich, Dept Elect & Comp Engn, Munich, Germany
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
MODEL PREDICTIVE CONTROL;
D O I
10.1109/CDC51059.2022.9992470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constraints for nominal policies learnt via standard RL techniques, based on probabilistic dynamics models. Our approach is based on a reformulation of state constraints in terms of cost functions, reducing safety verification to a standard RL task. By exploiting the concept of hallucinating inputs, we extend this formulation to determine a "backup" policy which is safe for the unknown system with high probability. The nominal policy is minimally adjusted at every time step during a roll-out towards the backup policy, such that safe recovery can be guaranteed afterwards. We provide formal safety guarantees, and empirically demonstrate the effectiveness of our approach.
引用
收藏
页码:3409 / 3415
页数:7
相关论文
共 50 条
  • [1] Improving Reinforcement Learning with Confidence-Based Demonstrations
    Wang, Zhaodong
    Taylor, Matthew E.
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3027 - 3033
  • [2] Confidence-based active learning
    Li, Mingkun
    Sethi, Ishwar K.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (08) : 1251 - 1261
  • [3] Confidence-Based Robot Navigation Under Sensor Occlusion with Deep Reinforcement Learning
    Ryu, Hyeongyeol
    Yoon, Minsung
    Park, Daehyung
    Yoon, Sung-eui
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8231 - 8237
  • [4] Confidence-Based Learning in Investment Analysis
    Serradell-Lopez, Enric
    Lara-Navarra, Pablo
    Castillo-Merino, David
    Gonzalez-Gonzalez, Ines
    [J]. TECHNOLOGY ENHANCED LEARNING: QUALITY OF TEACHING AND EDUCATIONAL REFORM, 2010, 73 : 28 - +
  • [5] CobLE : Confidence-based Learning Ensembles
    Buthpitiya, Senaka
    Dey, Anind K.
    Griss, Martin
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 386 - 391
  • [6] Confidence-based Assessment for Learning in ePortfolio Environment
    Lap Trung Nguyen
    [J]. PROCEEDINGS 2016 5TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS IIAI-AAI 2016, 2016, : 328 - 331
  • [7] Probabilistically Safe Robot Planning with Confidence-Based Human Predictions
    Fisac, Jaime E.
    Bajcsy, Andrea
    Herbert, Sylvia L.
    Fridovich-Keil, David
    Wang, Steven
    Tomlin, Claire J.
    Dragan, Anca D.
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [8] On Normative Reinforcement Learning via Safe Reinforcement Learning
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    [J]. PRIMA 2022: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2023, 13753 : 72 - 89
  • [9] Semantics and Feature Discovery via Confidence-Based Ensemble
    Goh, Kingshy
    Li, Beitao
    Chang, Edward Y.
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2005, 1 (02)
  • [10] Interactive Policy Learning through Confidence-Based Autonomy
    Chernova, Sonia
    Veloso, Manuela
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 1 - 25