Safe and Stable RL (S2RL) Driving Policies Using Control Barrier and Control Lyapunov Functions

被引:6
|
作者
Gangopadhyay, Briti [1 ]
Dasgupta, Pallab [1 ]
Dey, Soumyajit [1 ]
机构
[1] IIT Kharagpur, Dept Comp Sci Engn, Kharagpur, West Bengal, India
来源
关键词
Safety; Training; Stability analysis; Autonomous vehicles; Vehicle dynamics; Task analysis; Lyapunov methods; Reinforcement learning; autonomous vehicles;
D O I
10.1109/TIV.2022.3160202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has been successfully applied to learn policies for safety-critical systems with unknown model dynamics in simulation. DRL controllers though optimal in terms of reward, do not provide any safety and stability guarantees. With reliance on model information, safety conditions can be expressed as Control Barrier Functions (CBF's) and performance objectives can be expressed as Control Lyapunov Functions (CLF's) for real-time optimization-based controllers. In this work, we use an amalgamation of model-free RL and model-based controllers to establish safety and stability. We first design CLF, CBF Quadratic Programs (QP's) for different driving manoeuvres on nominal vehicle dynamics. Reinforcement Learning (RL) agents are trained to learn policies for the actual vehicle with enhanced dynamics. In order to incorporate safety and stability while retaining optimal behaviour we selectively guide the RL agents using CLF, CBF QP's. This results in both safe and stable ((SRL)-R-2) policies. We empirically validate the proposed methodology on different driving manoeuvres.
引用
收藏
页码:1889 / 1899
页数:11
相关论文
共 50 条
  • [1] Optimal control barrier functions for RL based safe powertrain control
    Hailemichael, Habtamu
    Ayalew, Beshah
    Ivanco, Andrej
    [J]. IFAC PAPERSONLINE, 2023, 56 (03): : 385 - 390
  • [2] Safe Learning for Control using Control Lyapunov Functions and Control Barrier Functions: A Review
    Anand, Akhil
    Seel, Katrine
    Gjaerum, Vilde
    Hakansson, Anne
    Robinson, Haakon
    Saad, Aya
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 3987 - 3997
  • [3] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
    Du, Desong
    Han, Shaohang
    Qi, Naiming
    Ammar, Haitham Bou
    Wang, Jun
    Pan, Wei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
  • [4] Safe Control for Navigation in Cluttered Space Using Multiple Lyapunov-Based Control Barrier Functions
    Jang, Inkyu
    Kim, H. Jin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03): : 2056 - 2063
  • [5] VECTOR CONTROL LYAPUNOV AND BARRIER FUNCTIONS FOR SAFE STABILIZATION OF INTERCONNECTED SYSTEMS
    Ren, Wei
    Li, Jingjie
    Xiong, Junlin
    Sun, Xi-ming
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (05) : 3209 - 3233
  • [6] Online Policies for Real-Time Control Using MRAC-RL
    Guha, Anubhav
    Annaswamy, Anuradha M.
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1808 - 1813
  • [7] Safe control synthesis using environmentally robust control barrier functions
    Hamdipoor, Vahid
    Meskin, Nader
    Cassandras, Christos G.
    [J]. EUROPEAN JOURNAL OF CONTROL, 2023, 74
  • [8] Safe Battery Control Using Cascade-Control-Barrier Functions
    Feng, Shuang
    de Castro, Ricardo
    Ebrahimi, Iman
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024,
  • [9] HJB Based Optimal Safe Control using Control Barrier Functions
    Almubarak, Hassan
    Theodorou, Evangelos A.
    Sadegh, Nader
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6829 - 6834
  • [10] Adaptive Robust Quadratic Programs using Control Lyapunov and Barrier Functions
    Zhao, Pan
    Mao, Yanbing
    Tao, Chuyuan
    Hovakimyan, Naira
    Wang, Xiaofeng
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3353 - 3358