Safety-Aware Apprenticeship Learning

被引:21
|
作者
Zhou, Weichao [1 ]
Li, Wenchao [1 ]
机构
[1] Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USA
关键词
D O I
10.1007/978-3-319-96145-3_38
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert's demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.
引用
收藏
页码:662 / 680
页数:19
相关论文
共 50 条
  • [1] Safety-Aware Preference-Based Learning for Safety-Critical Control
    Cosner, Ryan K.
    Tucker, Maegan
    Taylor, Andrew J.
    Li, Kejun
    Molnar, Tamas G.
    Ubellacker, Wyatt
    Alan, Anil
    Orosz, Gabor
    Yue, Yisong
    Ames, Aaron D.
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [2] Safety-aware Graph-based Semi-Supervised Learning
    Gan, Haitao
    Li, Zhenhua
    Wu, Wei
    Luo, Zhizeng
    Huang, Rui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 107 : 243 - 254
  • [3] Safety-Aware Metric for People Detection
    Wolf, Mirja
    Douat, Luiz R.
    Erz, Michael
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2759 - 2765
  • [4] Safety-Aware Adversarial Inverse Reinforcement Learning for Highway Autonomous Driving
    Li, Fangjian
    Wagner, John
    Wang, Yue
    [J]. Journal of Autonomous Vehicles and Systems, 2021, 1 (04):
  • [5] Safety-Aware Unsupervised Skill Discovery
    Kim, Sunin
    Kwon, Jaewoon
    Lee, Taeyoon
    Park, Younghyo
    Perez, Julien
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 894 - 900
  • [6] Safety-Aware Learning-Based Control of Systems with Uncertainty Dependent Constraints
    Chekan, Jafar Abbaszadeh
    Langbort, Cedric
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 1264 - 1270
  • [7] Safety-Aware Reinforcement Learning Framework with an Actor-Critic-Barrier Structure
    Yang, Yongliang
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    He, Wei
    Yin, Yixin
    Wunsch, Donald C.
    [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2352 - 2358
  • [8] Safety-Aware Semi-Supervised Classification
    Wang, Yunyun
    Chen, Songcan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (11) : 1763 - 1772
  • [9] Safety-Aware Algorithms for Adversarial Contextual Bandit
    Sun, Wen
    Dey, Debadeepta
    Kapoor, Ashish
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [10] Safety-Aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving
    Lin, Haohong
    Ding, Wenhao
    Liu, Zuxin
    Niu, Yaru
    Zhu, Jiacheng
    Niu, Yuming
    Zhao, Ding
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4639 - 4646