A Finite-Horizon Inverse Linear Quadratic Optimal Control Method for Human-in-the-Loop Behavior Learning

被引:1
|
作者
Wu, Huai-Ning [1 ,2 ]
Li, Wen-Hua [3 ]
Wang, Mi [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Finite-horizon inverse optimal control (FHIOC); human-in-the-loop (HiTL); recursive least squares; time convexity; ADAPTIVE-CONTROL; MODEL;
D O I
10.1109/TSMC.2024.3357973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The key to enhancing machine intelligence is to make the machine learn how human beings perform tasks. In this article, the issue of finite-horizon inverse linear quadratic (LQ) optimal control is investigated for human behavior learning in a class of human-in-the-loop (HiTL) systems. A novel finite-horizon inverse optimal control (FHIOC) approach is developed by integrating time-varying parameter identification and linear matrix inequality (LMI) optimization techniques. The proposed approach covers three steps: by only using the system state measurement, 1) an offline identification method is developed to provide a batch least-squares estimation for the human time-varying feedback gain matrix; 2) a recursive least-squares adaptive law is proposed to online learn the human time-varying feedback gain in real time; and 3) the weighting matrices of the human cost function are recovered via the time-convexity and LMI optimization techniques with the learned time-varying feedback gain. Finally, the validity of the proposed methods is supported by a supplementary steering system of an intelligent vehicle.
引用
收藏
页码:3461 / 3470
页数:10
相关论文
共 50 条
  • [1] System identification approach for inverse optimal control of finite-horizon linear quadratic regulators
    Yu, Chengpu
    Li, Yao
    Fang, Hao
    Chen, Jie
    [J]. AUTOMATICA, 2021, 129
  • [2] Inverse optimal control for discrete-time finite-horizon Linear Quadratic Regulators
    Zhang, Han
    Umenberger, Jack
    Hu, Xiaoming
    [J]. AUTOMATICA, 2019, 110
  • [3] A unified approach to the finite-horizon linear quadratic optimal control problem
    Ferrante, Augusto
    Ntogramatzidis, Lorenzo
    [J]. EUROPEAN JOURNAL OF CONTROL, 2007, 13 (05) : 473 - 488
  • [4] Inverse Optimal Control for Finite-Horizon Discrete-time Linear Quadratic Regulator Under Noisy Output
    Zhang, Han
    Li, Yibei
    Hu, Xiaoming
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 6663 - 6668
  • [5] Finite-Horizon Indefinite Mean-Field Stochastic Linear-Quadratic Optimal Control
    Ni, Yuan-Hua
    Li, Xun
    Zhang, Ji-Feng
    [J]. IFAC PAPERSONLINE, 2015, 48 (28): : 211 - 216
  • [6] Online Learning Human Behavior for a Class of Human-in-the-Loop Systems via Adaptive Inverse Optimal Control
    Wu, Huai-Ning
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (05) : 1004 - 1014
  • [7] Finite-horizon and infinite-horizon linear quadratic optimal control problems: A data-driven Euler scheme
    Wang, Guangchen
    Zhang, Heng
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (13):
  • [8] Inverse linear-quadratic discrete-time finite-horizon optimal control for indistinguishable homogeneous agents: A convex optimization approach
    Zhang, Han
    Ringh, Axel
    [J]. AUTOMATICA, 2023, 148
  • [9] Finite-Horizon Linear-Quadratic Optimal Control of Discrete-Time Systems with Input Delay
    Ignaciuk, Przemyslaw
    [J]. 2014 18TH INTERNATIONAL CONFERENCE SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2014, : 797 - 802
  • [10] Finite-horizon inverse optimal control for discrete-time nonlinear systems
    Molloy, Timothy L.
    Ford, Jason J.
    Perez, Tristan
    [J]. AUTOMATICA, 2018, 87 : 442 - 446