Learning Control Barrier Functions from Expert Demonstrations

被引:0
|
作者
Robey, Alexander [1 ]
Hu, Haimin [1 ]
Lindemann, Lars [2 ]
Zhang, Hanwen [1 ]
Dimarogonas, Dimos, V [2 ]
Tu, Stephen [3 ]
Matni, Nikolai [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] KTH Royal Inst Technol, Div Decis & Control Syst, Stockholm, Sweden
[3] Google Brain Robot, New York, NY USA
关键词
SAFETY;
D O I
10.1109/cdc42340.2020.9303785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inspired by the success of imitation and inverse reinforcement learning in replicating expert behavior through optimal control, we propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs). We consider the setting of a known nonlinear control affine dynamical system and assume that we have access to safe trajectories generated by an expert - a practical example of such a setting would be a kinematic model of a self-driving vehicle with safe trajectories (e.g., trajectories that avoid collisions with obstacles in the environment) generated by a human driver. We then propose and analyze an optimization based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz smoothness assumptions on the underlying dynamical system. A strength of our approach is that it is agnostic to the parameterization used to represent the CBF, assuming only that the Lipschitz constant of such functions can be efficiently bounded. Furthermore, if the CBF parameterization is convex, then under mild assumptions, so is our learning process. We end with extensive numerical evaluations of our results on both planar and realistic examples, using both random feature and deep neural network parameterizations of the CBF. To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
引用
收藏
页码:3717 / 3724
页数:8
相关论文
共 50 条
  • [41] Learning Robust Hybrid Control Barrier Functions for Uncertain Systems
    Robey, Alexander
    Lindemann, Lars
    Tu, Stephen
    Matni, Nikolai
    [J]. IFAC PAPERSONLINE, 2021, 54 (05): : 1 - 6
  • [42] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
    Du, Desong
    Han, Shaohang
    Qi, Naiming
    Ammar, Haitham Bou
    Wang, Jun
    Pan, Wei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
  • [43] Learning From Sparse Demonstrations
    Jin, Wanxin
    Murphey, Todd D.
    Kulic, Dana
    Ezer, Neta
    Mou, Shaoshuai
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 645 - 664
  • [44] Learning to Generalize from Demonstrations
    Browne, Katie
    Nicolescu, Monica
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2012, 12 (03) : 27 - 38
  • [45] A robotic shared control teleoperation method based on learning from demonstrations
    Xi, Bao
    Wang, Shuo
    Ye, Xuemei
    Cai, Yinghao
    Lu, Tao
    Wang, Rui
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (04): : 1 - 13
  • [46] A Wheeled Inverted Pendulum Learning Stable and Accurate Control from Demonstrations
    Jin, Shaokun
    Ou, Yongsheng
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (24):
  • [47] One-shot Assistance Estimation from Expert Demonstrations for a Shared Control Wheelchair System
    Kucukyilmaz, Ayse
    Demiris, Yiannis
    [J]. 2015 24TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2015, : 438 - 443
  • [48] Learning from Corrective Demonstrations
    Gutierrez, Reymundo A.
    Short, Elaine Schaertl
    Niekum, Scott
    Thomaz, Andrea L.
    [J]. HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 712 - 714
  • [49] Learning Reward Functions by Integrating Human Demonstrations and Preferences
    Palan, Malayandi
    Shevchuk, Gleb
    Landolfi, Nicholas C.
    Sadigh, Dorsa
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
  • [50] Embedding expert demonstrations into clustering buffer for effective deep reinforcement learning
    Wang, Shihmin
    Zhao, Binqi
    Zhang, Zhengfeng
    Zhang, Junping
    Pu, Jian
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2023, 24 (11) : 1541 - 1556