Learning Control Barrier Functions from Expert Demonstrations

被引:0
|
作者
Robey, Alexander [1 ]
Hu, Haimin [1 ]
Lindemann, Lars [2 ]
Zhang, Hanwen [1 ]
Dimarogonas, Dimos, V [2 ]
Tu, Stephen [3 ]
Matni, Nikolai [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] KTH Royal Inst Technol, Div Decis & Control Syst, Stockholm, Sweden
[3] Google Brain Robot, New York, NY USA
关键词
SAFETY;
D O I
10.1109/cdc42340.2020.9303785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inspired by the success of imitation and inverse reinforcement learning in replicating expert behavior through optimal control, we propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs). We consider the setting of a known nonlinear control affine dynamical system and assume that we have access to safe trajectories generated by an expert - a practical example of such a setting would be a kinematic model of a self-driving vehicle with safe trajectories (e.g., trajectories that avoid collisions with obstacles in the environment) generated by a human driver. We then propose and analyze an optimization based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz smoothness assumptions on the underlying dynamical system. A strength of our approach is that it is agnostic to the parameterization used to represent the CBF, assuming only that the Lipschitz constant of such functions can be efficiently bounded. Furthermore, if the CBF parameterization is convex, then under mild assumptions, so is our learning process. We end with extensive numerical evaluations of our results on both planar and realistic examples, using both random feature and deep neural network parameterizations of the CBF. To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
引用
收藏
页码:3717 / 3724
页数:8
相关论文
共 50 条
  • [1] Redundant robot control with learning from expert demonstrations
    Ramirez, Jorge
    Yu, Wen
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720
  • [2] Learning control lyapunov functions from counterexamples and demonstrations
    Hadi Ravanbakhsh
    Sriram Sankaranarayanan
    [J]. Autonomous Robots, 2019, 43 : 275 - 307
  • [3] Learning control lyapunov functions from counterexamples and demonstrations
    Ravanbakhsh, Hadi
    Sankaranarayanan, Sriram
    [J]. AUTONOMOUS ROBOTS, 2019, 43 (02) : 275 - 307
  • [4] Reinforcement learning from expert demonstrations with application to redundant robot control
    Ramirez, Jorge
    Yu, Wen
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [5] Watch and Learn: Learning to control feedback linearizable systems from expert demonstrations
    Sultangazin, Alimzhan
    Pannocchi, Luigi
    Fraile, Lucas
    Tabuada, Paulo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8577 - 8583
  • [6] Learning Deformable Object Manipulation From Expert Demonstrations
    Salhotra, Gautam
    Liu, I-Chun Arthur
    Dominguez-Kuhne, Marcus
    Sukhatme, Gaurav S.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 8775 - 8782
  • [7] Exploiting the experts: Learning to control unknown SISO feedback linearizable systems from expert demonstrations
    Sultangazin, Alimzhan
    Fraile, Lucas
    Tabuada, Paulo
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 5789 - 5794
  • [8] A Multi-expert Agent for Efficient Learning from Demonstrations
    Chen, Yiwen
    Zhang, Zedong
    Liu, Haofeng
    Tan, Jiayi
    Chew, Chee-Meng
    Ang, Marcelo H.
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 18, VOL 1, IAS18-2023, 2024, 795 : 305 - 316
  • [9] Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations
    Tirumala, Sashank
    Gubbi, Sagar
    Paigwar, Kartik
    Sagi, Aditya
    Joglekar, Ashish
    Bhatnagar, Shalabh
    Ghosal, Ashitava
    Amrutur, Bharadwaj
    Kolathaya, Shishir
    [J]. 2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 1107 - 1112
  • [10] Learning Lyapunov (Potential) Functions from Counterexamples and Demonstrations
    Ravanbakhsh, Hadi
    Sankaranarayanan, Sriram
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,