Safe and Stable RL (S2RL) Driving Policies Using Control Barrier and Control Lyapunov Functions

被引:6
|
作者
Gangopadhyay, Briti [1 ]
Dasgupta, Pallab [1 ]
Dey, Soumyajit [1 ]
机构
[1] IIT Kharagpur, Dept Comp Sci Engn, Kharagpur, West Bengal, India
来源
关键词
Safety; Training; Stability analysis; Autonomous vehicles; Vehicle dynamics; Task analysis; Lyapunov methods; Reinforcement learning; autonomous vehicles;
D O I
10.1109/TIV.2022.3160202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has been successfully applied to learn policies for safety-critical systems with unknown model dynamics in simulation. DRL controllers though optimal in terms of reward, do not provide any safety and stability guarantees. With reliance on model information, safety conditions can be expressed as Control Barrier Functions (CBF's) and performance objectives can be expressed as Control Lyapunov Functions (CLF's) for real-time optimization-based controllers. In this work, we use an amalgamation of model-free RL and model-based controllers to establish safety and stability. We first design CLF, CBF Quadratic Programs (QP's) for different driving manoeuvres on nominal vehicle dynamics. Reinforcement Learning (RL) agents are trained to learn policies for the actual vehicle with enhanced dynamics. In order to incorporate safety and stability while retaining optimal behaviour we selectively guide the RL agents using CLF, CBF QP's. This results in both safe and stable ((SRL)-R-2) policies. We empirically validate the proposed methodology on different driving manoeuvres.
引用
收藏
页码:1889 / 1899
页数:11
相关论文
共 50 条
  • [31] Safe exploration in model-based reinforcement learning using control barrier functions
    Cohen, Max H.
    Belta, Calin
    [J]. AUTOMATICA, 2023, 147
  • [32] Safe Navigation and Obstacle Avoidance Using Differentiable Optimization Based Control Barrier Functions
    Dai, Bolun
    Khorrambakht, Rooholla
    Krishnamurthy, Prashanth
    Gonçalves, Vinícius
    Tzes, Anthony
    Khorrami, Farshad
    [J]. arXiv, 2023,
  • [33] Safe Navigation and Obstacle Avoidance Using Differentiable Optimization Based Control Barrier Functions
    Dai, Bolun
    Khorrambakht, Rooholla
    Krishnamurthy, Prashanth
    Goncalves, Vinicius
    Tzes, Anthony
    Khorrami, Farshad
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5376 - 5383
  • [34] Semi-Global Adaptive Control Using Barrier Lyapunov Functions For A Class of Uncertain Nonlinear Systems
    Chen Pengnian
    Sun Lingling
    [J]. DIGITAL MANUFACTURING & AUTOMATION III, PTS 1 AND 2, 2012, 190-191 : 1053 - 1056
  • [35] Distributed Coordination Control for Multi-Robot Networks Using Lyapunov-Like Barrier Functions
    Panagou, Dimitra
    Stipanovic, Dusan M.
    Voulgaris, Petros G.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (03) : 617 - 632
  • [36] An improved adaptive online neural control for robot manipulator systems using integral Barrier Lyapunov functions
    Xia, Jun
    Zhang, Yujia
    Yang, Chenguang
    Wang, Min
    Annamalai, Andy
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2019, 50 (03) : 638 - 651
  • [37] Design of stable fuzzy control systems using Lyapunov's method in fuzzy hypercubes
    Chen, CS
    [J]. FUZZY SETS AND SYSTEMS, 2003, 139 (01) : 95 - 110
  • [38] Safe Navigation of Networked Robots under Localization Uncertainty using Robust Control Barrier Functions
    Miksits, Adam
    Barbosa, Fernando S.
    Lindhe, Magnus
    Araujo, Jose
    Johansson, Karl H.
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6064 - 6071
  • [39] Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions
    Cheng, Yikun
    Zhao, Pan
    Hovakimyan, Naira
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [40] Reactive Safe Path Following for Differential Drive Mobile Robots Using Control Barrier Functions
    Toulkani, Naeim Ebrahimi
    Abdi, Hossein
    Koskelainen, Olli
    Ghabcheloo, Reza
    [J]. 2022 10TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION (ICCMA 2022), 2022, : 60 - 65