Safe and Stable RL (S2RL) Driving Policies Using Control Barrier and Control Lyapunov Functions

被引:6
|
作者
Gangopadhyay, Briti [1 ]
Dasgupta, Pallab [1 ]
Dey, Soumyajit [1 ]
机构
[1] IIT Kharagpur, Dept Comp Sci Engn, Kharagpur, West Bengal, India
来源
关键词
Safety; Training; Stability analysis; Autonomous vehicles; Vehicle dynamics; Task analysis; Lyapunov methods; Reinforcement learning; autonomous vehicles;
D O I
10.1109/TIV.2022.3160202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has been successfully applied to learn policies for safety-critical systems with unknown model dynamics in simulation. DRL controllers though optimal in terms of reward, do not provide any safety and stability guarantees. With reliance on model information, safety conditions can be expressed as Control Barrier Functions (CBF's) and performance objectives can be expressed as Control Lyapunov Functions (CLF's) for real-time optimization-based controllers. In this work, we use an amalgamation of model-free RL and model-based controllers to establish safety and stability. We first design CLF, CBF Quadratic Programs (QP's) for different driving manoeuvres on nominal vehicle dynamics. Reinforcement Learning (RL) agents are trained to learn policies for the actual vehicle with enhanced dynamics. In order to incorporate safety and stability while retaining optimal behaviour we selectively guide the RL agents using CLF, CBF QP's. This results in both safe and stable ((SRL)-R-2) policies. We empirically validate the proposed methodology on different driving manoeuvres.
引用
收藏
页码:1889 / 1899
页数:11
相关论文
共 50 条
  • [21] Robust control of linear systems under input saturation using Barrier Lyapunov functions
    Mera M.
    Salgado I.
    [J]. International Journal of Dynamics and Control, 2018, 6 (3) : 1231 - 1238
  • [22] Safety-Critical and Constrained Geometric Control Synthesis using Control Lyapunov and Control Barrier Functions for Systems Evolving on Manifolds
    Wu, Guofan
    Sreenath, Koushil
    [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 2038 - 2044
  • [23] Safe Navigation in Human Occupied Environments Using Sampling and Control Barrier Functions
    Majd, Keyvan
    Yaghoubi, Shakiba
    Yamaguchi, Tomoya
    Hoxha, Bardh
    Prokhorov, Danil
    Fainekos, Georgios
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5794 - 5800
  • [24] Safe and Robust Observer-Controller Synthesis Using Control Barrier Functions
    Agrawal, Devansh R.
    Panagou, Dimitra
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 7 : 127 - 132
  • [25] Safe-by-construction autonomous vehicle overtaking using control barrier functions and model predictive control
    Yuan, Dingran
    Yu, Xinyi
    Li, Shaoyuan
    Yin, Xiang
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (07) : 1283 - 1303
  • [26] Safe Control for Soft-Rigid Robots with Self-Contact using Control Barrier Functions
    Patterson, Zach J.
    Xiao, Wei
    Sologuren, Emily
    Rus, Daniela
    [J]. 2024 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFT ROBOTICS, ROBOSOFT, 2024, : 151 - 156
  • [27] Safe driving with control barrier functions in mixed autonomy traffic when cut-ins occur
    Gunter, George
    Work, Daniel
    [J]. 2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 411 - 416
  • [28] Formation Control and Obstacle Avoidance for Multi-agent Systems Using Barrier Lyapunov Functions
    Lian, Jie
    Meng, Yang
    Li, Li-li
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 15 - 20
  • [29] Robust Control Barrier Functions for Safe Control Under Uncertainty Using Extended State Observer and Output Measurement
    Chen, Jinfeng
    Gao, Zhiqiang
    Lin, Qin
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 8477 - 8482
  • [30] Safe exploration in model-based reinforcement learning using control barrier functions
    Cohen, Max H.
    Belta, Calin
    [J]. AUTOMATICA, 2023, 147