Online Learning Human Behavior for a Class of Human-in-the-Loop Systems via Adaptive Inverse Optimal Control

被引:17
|
作者
Wu, Huai-Ning [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100190, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal control; Adaptive systems; Cost function; Task analysis; Linear matrix inequalities; Symmetric matrices; Trajectory; Adaptive estimation; human behavior learning; human-in-the-loop (HiTL); inverse optimal control (IOC); linear matrix inequality (LMI); linear quadratic regulator (LQR);
D O I
10.1109/THMS.2022.3155369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To enhance the machines' intelligence, it is important for them to learn how humans perform tasks. In this article, the issue of online adaptive learning human behavior is addressed for a class of human-in-the-loop (HiTL) systems using the state measurement only. The hypothesis underlying our study is that human behavior can be described by a linear quadratic optimal control model with an unknown weighting matrix for the quadratic cost function. In this model, the weighting matrix depicts the human tradeoff of various objectives. Our aim is thus to only use the system state measurement for learning the weighting matrix under the condition that human feedback gain matrix is unknown. A novel adaptive inverse optimal control approach to online learning human behavior is proposed for the HiTL system, which integrates adaptive estimation and linear matrix inequality (LMI) optimization techniques. Our approach consists of two steps: First, an adaptive law is developed to learn the human feedback gain matrix online using the system state measurement only, and second, the weighting matrix of human cost function is retrieved by solving an LMI optimization problem with the learned feedback gain matrix. Finally, simulation and experiment results on a steering assist system of intelligent vehicles are presented to illustrate the effectiveness of the proposed method.
引用
收藏
页码:1004 / 1014
页数:11
相关论文
共 50 条
  • [1] Human-in-the-Loop Behavior Modeling via an Integral Concurrent Adaptive Inverse Reinforcement Learning
    Wu, Huai-Ning
    Wang, Mi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11359 - 11370
  • [2] Composite adaptive online inverse optimal control approach to human behavior learning
    Lin, Jie
    Wang, Mi
    Wu, Huai-Ning
    [J]. INFORMATION SCIENCES, 2023, 638
  • [3] A Finite-Horizon Inverse Linear Quadratic Optimal Control Method for Human-in-the-Loop Behavior Learning
    Wu, Huai-Ning
    Li, Wen-Hua
    Wang, Mi
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3461 - 3470
  • [4] Human Behavior Learning for a Class of Nonlinear Human-in-the-Loop Systems via Takagi-Sugeno Fuzzy Model
    Wu, Huai-Ning
    Lin, Jie
    Wang, Mi
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (06) : 3355 - 3367
  • [5] Reachable Set Estimation of a class of Human-in-the-Loop Control Systems
    Zhang, Xiu-Mei
    Wu, Huai-Ning
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1083 - 1087
  • [6] Distributed Formation Control for a Class of Human-in-the-Loop Multiagent Systems
    Zhang, Xiao-Xiao
    Wu, Huai-Ning
    Wang, Jin-Liang
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (04) : 416 - 426
  • [7] Human models in human-in-the-loop control systems
    Mabrok, Mohamed A.
    Mohamed, Hassan K.
    Abdel-Aty, Abdel-Haleem
    Alzahrani, Ahmed S.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 2611 - 2622
  • [8] Inverse Control for Inferring Intent in Novice Human-in-the-loop Iterative Learning
    Warrier, Rahul B.
    Devasia, Santosh
    [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 2148 - 2154
  • [9] "Weak" Control for Human-in-the-Loop Systems
    Inoue, Masaki
    Gupta, Vijay
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2019, 3 (02): : 440 - 445
  • [10] Stochastic Stability Analysis and Synthesis of a Class of Human-in-the-Loop Control Systems
    Wu, Huai-Ning
    Zhang, Xiu-Mei
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 822 - 832