Safe Reinforcement Learning Using Wasserstein Distributionally Robust MPC and Chance Constraint

被引:5
|
作者
Kordabad, Arash Bahari [1 ]
Wisniewski, Rafael [2 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, N-7034 Trondheim, Norway
[2] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
关键词
Safe reinforcement learning; model predictive control; distributionally robust optimization; chance constraint; conditional value at risk; Q-learning; OPTIMIZATION;
D O I
10.1109/ACCESS.2022.3228922
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address the chance-constrained safe Reinforcement Learning (RL) problem using the function approximators based on Stochastic Model Predictive Control (SMPC) and Distributionally Robust Model Predictive Control (DRMPC). We use Conditional Value at Risk (CVaR) to measure the probability of constraint violation and safety. In order to provide a safe policy by construction, we first propose using parameterized nonlinear DRMPC at each time step. DRMPC optimizes a finite-horizon cost function subject to the worst-case constraint violation in an ambiguity set. We use a statistical ball around the empirical distribution with a radius measured by the Wasserstein metric as the ambiguity set. Unlike the sample average approximation SMPC, DRMPC provides a probabilistic guarantee of the out-of-sample risk and requires lower samples from the disturbance. Then the Q-learning method is used to optimize the parameters in the DRMPC to achieve the best closed-loop performance. Wheeled Mobile Robot (WMR) path planning with obstacle avoidance will be considered to illustrate the efficiency of the proposed method.
引用
收藏
页码:130058 / 130067
页数:10
相关论文
共 50 条
  • [31] Consistency of Distributionally Robust Risk- and Chance-Constrained Optimization Under Wasserstein Ambiguity Sets
    Cherukuri, Ashish
    Hota, Ashish R.
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (05): : 1729 - 1734
  • [32] A Distributionally Robust Approach to Regret Optimal Control using the Wasserstein Distance
    Al Taha, Feras
    Yan, Shuhao
    Bitar, Eilyan
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2768 - 2775
  • [33] A Two-Step Approach to Wasserstein Distributionally Robust Chance- and Security-Constrained Dispatch
    Maghami, Amin
    Ursavas, Evrim
    Cherukuri, Ashish
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (01) : 1447 - 1459
  • [34] Consistency of Distributionally Robust Risk- and Chance-Constrained Optimization under Wasserstein Ambiguity Sets
    Cherukuri, Ashish
    Hota, Ashish R.
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3818 - 3823
  • [35] Safe Reinforcement Learning: Learning with Supervision Using a Constraint-Admissible Set
    Li, Zhaojian
    Kalabic, Uros
    Chu, Tianshu
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 6390 - 6395
  • [36] Safe, learning-based MPC for highway driving under lane-change uncertainty: A distributionally robust approach
    Schuurmans, Mathijs
    Katriniok, Alexander
    Meissen, Christopher
    Tseng, H. Eric
    Patrinos, Panagiotis
    ARTIFICIAL INTELLIGENCE, 2023, 320
  • [37] A Survey of Constraint Formulations in Safe Reinforcement Learning
    Wachi, Akifumi
    Shen, Xun
    Sui, Yanan
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8262 - 8271
  • [38] Safe Reinforcement Learning Using Robust Control Barrier Functions
    Emam, Yousef
    Notomista, Gennaro
    Glotfelter, Paul
    Kira, Zsolt
    Egerstedt, Magnus
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2886 - 2893
  • [39] Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables
    Xu, Mengdi
    Huang, Peide
    Niu, Yaru
    Kumar, Visak
    Qiu, Jielin
    Fang, Chao
    Lee, Kuan-Hui
    Qi, Xuewei
    Lam, Henry
    Li, Bo
    Zhao, Ding
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [40] Distributionally robust joint chance-constrained programming: Wasserstein metric and second-order moment constraints
    Shiraz, Rashed Khanjani
    Nodeh, Zohreh Hosseini
    Babapour-Azar, Ali
    Roemer, Michael
    Pardalos, Panos M.
    INFORMATION SCIENCES, 2024, 654