A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引:0
|
作者
Zarrouki, Baha [1 ,2 ]
Spanakakis, Marios
Betz, Johannes
机构
[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany
[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany
关键词
MPC;
D O I
10.1109/IV55156.2024.10588747
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL
引用
收藏
页码:1401 / 1408
页数:8
相关论文
共 50 条
  • [1] Weights-varying MPC for Autonomous Vehicle Guidance: a Deep Reinforcement Learning Approach
    Zarrouki, Baha
    Kloes, Verena
    Heppner, Nikolas
    Schwan, Simon
    Ritschel, Robert
    Vosswinkel, Rick
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 119 - 125
  • [2] Integration of Robust Control with Reinforcement Learning for Safe Autonomous Vehicle Motion
    Lelko, Attila
    Nemeth, Balazs
    Fenyes, Daniel
    Gaspar, Peter
    IFAC PAPERSONLINE, 2023, 56 (02): : 1101 - 1106
  • [3] Safe Reinforcement Learning using Data-Driven Predictive Control
    Selim, Mahmoud
    Alanwar, Amr
    El-Kharashi, M. Watheq
    Abbas, Hazem M.
    Johansson, Karl H.
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [4] Prediction Horizon-Varying Model Predictive Control (MPC) for Autonomous Vehicle Control
    Chen, Zhenbin
    Lai, Jiaqin
    Li, Peixin
    Awad, Omar I.
    Zhu, Yubing
    ELECTRONICS, 2024, 13 (08)
  • [5] MACHINE LEARNING AND MODEL PREDICTIVE CONTROL FOR SAFE AUTONOMOUS SYSTEMS
    Findeisen, Rolf
    Matschek, Janine
    Maiworm, Michael
    Savchenko, Anton
    PROCEEDINGS OF THE7TH INTERNATIONAL CONFERENCE ON CONTROL AND OPTIMIZATION WITH INDUSTRIAL APPLICATIONS, VOL. 1, 2020, : 26 - 28
  • [6] Motion Control of Autonomous Vehicle with Domain-Centralized Electronic and Electrical Architecture based on Predictive Reinforcement Learning Control Method
    Du, Guodong
    Zou, Yuan
    Zhang, Xudong
    Zhao, Kaiyu
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1409 - 1416
  • [7] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    RuiXian Zhang
    YiNing Han
    Man Su
    ZeFeng Lin
    HaoWei Li
    LiXian Zhang
    Science China Technological Sciences, 2024, 67 : 172 - 182
  • [8] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    Zhang, RuiXian
    Han, YiNing
    Su, Man
    Lin, ZeFeng
    Li, HaoWei
    Zhang, LiXian
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (01) : 172 - 182
  • [9] Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots
    ZHANG RuiXian
    HAN YiNing
    SU Man
    LIN ZeFeng
    LI HaoWei
    ZHANG LiXian
    Science China(Technological Sciences), 2024, 67 (01) : 172 - 182
  • [10] Model predictive control of an autonomous vehicle
    Kim, B
    Necsulescu, D
    Sasiadek, J
    2001 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS PROCEEDINGS, VOLS I AND II, 2001, : 1279 - 1284