Inverse Model Predictive Control: Learning Optimal Control Cost Functions for MPC

被引:0
|
作者
Zhang, Fawang [1 ]
Duan, Jingliang [2 ]
Xu, Haoyuan [2 ]
Chen, Hao [3 ]
Liu, Hui [1 ]
Nie, Shida [1 ]
Li, Shengbo Eben [4 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Mech Engn, Beijing 100083, Peoples R China
[3] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China
[4] Tsinghua Univ, Sch Vehicle & Mobil, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Trajectory; Costs; Cost function; Task analysis; Optimal control; Predictive control; Tuning; Bilevel optimization; imitation learning; inverse model predictive control (IMPC); ROBOTS;
D O I
10.1109/TII.2024.3424238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inverse optimal control (IOC) seeks to infer a control cost function that captures the underlying goals and preferences of expert demonstrations. While significant progress has been made in finite-horizon IOC, which focuses on learning control cost functions based on rollout trajectories rather than actual trajectories, the application of IOC to receding horizon control, also known as model predictive control (MPC), has been overlooked. MPC is more prevalent in practical settings and poses additional challenges for IOC learning since it is complicated to calculate the gradient of actual trajectories with respect to cost parameters. In light of this, we propose the inverse MPC (IMPC) method to identify the optimal cost function that effectively minimizes the discrepancy between the actual trajectory and its associated demonstration. To compute the gradient of actual trajectories with respect to cost parameters, we first establish two differential Pontryagin's maximum principle (PMP) conditions by differentiating the traditional PMP conditions with respect to cost parameters and initial states, respectively. We then formulate two auxiliary optimal control problems based on the derived differentiated PMP conditions, whose solutions can be directly used to determine the gradient for updating cost parameters. We validate the efficacy of the proposed method through experiments involving five simulation tasks and two real-world mobile robot control tasks. The results consistently demonstrate that IMPC outperforms existing finite-horizon IOC methods across all experiments.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Inverse Optimal Control for Multiphase Cost Functions
    Jin, Wanxin
    Kulic, Dana
    Lin, Jonathan Feng-Shun
    Mou, Shaoshuai
    Hirche, Sandra
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (06) : 1387 - 1398
  • [2] Cost-to-travel functions: A new perspective on optimal and model predictive control
    Houska, Boris
    Mueller, Matthias A.
    [J]. SYSTEMS & CONTROL LETTERS, 2017, 106 : 79 - 86
  • [3] Model Predictive Control and the Selection of Cost Functions
    Rivera, Marco
    Rojas, Diego
    Wheeler, Patrick
    [J]. 2021 23RD EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE'21 ECCE EUROPE), 2021,
  • [4] Fusion of inverse optimal and model predictive control strategies
    Ulusoy, Luetfi
    Guzelkaya, Mujde
    Eksin, Ibrahim
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (06) : 1122 - 1134
  • [5] Optimal Signal Design in System Identification for Model Predictive Control (MPC)
    Sarker, Md. Tanjil
    Ramasamy, Gobbi
    [J]. IEEE ACCESS, 2023, 11 : 140229 - 140237
  • [6] Capability model of predictive control (MPC)
    Predictive Control Ltd
    [J]. Meas Control, 4 (106-111):
  • [7] The capability model of predictive control (MPC)
    Sandoz, DJ
    [J]. MEASUREMENT & CONTROL, 1996, 29 (04): : 106 - 111
  • [8] Inverse Optimal Control Approach to Model Predictive Control for Linear System Models
    Ulusoy, Lutfi
    Guzelkaya, Mujde
    Eksin, Ibrahim
    [J]. 2017 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2017, : 823 - 827
  • [9] The Selection of Cost Functions in Model Predictive Control Applications
    Rivera, Marco
    Rojas, Diego
    Wheeler, Patrick
    [J]. 2021 21ST INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS (EE 2021), 2021,
  • [10] Inverse optimal control and construction of control Lyapunov functions
    Shahmansoorian A.
    [J]. Journal of Mathematical Sciences, 2009, 161 (2) : 297 - 307