A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引:0
|
作者
Zarrouki, Baha [1 ,2 ]
Spanakakis, Marios
Betz, Johannes
机构
[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany
[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany
关键词
MPC;
D O I
10.1109/IV55156.2024.10588747
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL
引用
收藏
页码:1401 / 1408
页数:8
相关论文
共 50 条
  • [31] Autonomous Underwater Vehicle Motion Planning via Sampling Based Model Predictive Control
    Wang, Lin-Lin
    Wang, Hong-Jian
    Pan, Li-Xin
    APPLIED MECHANICS, MATERIALS AND MANUFACTURING IV, 2014, 670-671 : 1370 - 1377
  • [32] Stability of model predictive control with time-varying weights
    Zheng, A
    COMPUTERS & CHEMICAL ENGINEERING, 1997, 21 (12) : 1389 - 1393
  • [33] Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning
    Li, Xinmao
    Geng, Lingbo
    Liu, Kaizhou
    Zhao, Yifeng
    Du, Weifeng
    OCEAN ENGINEERING, 2024, 313
  • [34] Autonomous Vehicle Longitudinal Following Control Based On Model Predictive Control
    Wang Qiu
    Qu Ting
    Yu Shuyou
    Guo Hongyan
    Chen Hong
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 8126 - 8131
  • [35] A Model Predictive Control Scheme for Autonomous Underwater Vehicle Formation Control
    Gomes, Rui
    Pereira, Fernando Lobo
    2018 13TH APCA INTERNATIONAL CONFERENCE ON CONTROL AND SOFT COMPUTING (CONTROLO), 2018, : 195 - 200
  • [36] Data-driven Predictive Control for Safe Motion Planning
    Dai, Li
    Huang, Teng
    Gao, Yulong
    Li, Sihang
    Deng, Yunshan
    Xia, Yuanqing
    UNMANNED SYSTEMS, 2025,
  • [37] Proximate Model Predictive Control Strategy for Autonomous Vehicle Lateral Control
    Lee, Seung-Hi
    Lee, Young Ok
    Son, Youngseop
    Chung, Chung Choo
    2011 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2011, : 590 - 595
  • [38] Proximate Model Predictive Control Strategy for Autonomous Vehicle Lateral Control
    Lee, Seung-Hi
    Lee, Young Ok
    Kim, Bo-Ah
    Chung, Chung Choo
    2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 3605 - 3610
  • [39] Intermittent model predictive control of an autonomous underwater vehicle
    Truong, Quan
    Wang, Liuping
    Gawthrop, P.
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1876 - +
  • [40] Model Predictive Control for Full Autonomous Vehicle Overtaking
    Lamouik, Imad
    Yahyaouy, Ali
    Sabri, My Abdelouahed
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (05) : 1193 - 1207