Online Learning Human Behavior for a Class of Human-in-the-Loop Systems via Adaptive Inverse Optimal Control

被引：17

作者：

Wu, Huai-Ning ^{[1
,2
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100190, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

来源：

IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS | 2022年 / 52卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Optimal control; Adaptive systems; Cost function; Task analysis; Linear matrix inequalities; Symmetric matrices; Trajectory; Adaptive estimation; human behavior learning; human-in-the-loop (HiTL); inverse optimal control (IOC); linear matrix inequality (LMI); linear quadratic regulator (LQR);

D O I：

10.1109/THMS.2022.3155369

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To enhance the machines' intelligence, it is important for them to learn how humans perform tasks. In this article, the issue of online adaptive learning human behavior is addressed for a class of human-in-the-loop (HiTL) systems using the state measurement only. The hypothesis underlying our study is that human behavior can be described by a linear quadratic optimal control model with an unknown weighting matrix for the quadratic cost function. In this model, the weighting matrix depicts the human tradeoff of various objectives. Our aim is thus to only use the system state measurement for learning the weighting matrix under the condition that human feedback gain matrix is unknown. A novel adaptive inverse optimal control approach to online learning human behavior is proposed for the HiTL system, which integrates adaptive estimation and linear matrix inequality (LMI) optimization techniques. Our approach consists of two steps: First, an adaptive law is developed to learn the human feedback gain matrix online using the system state measurement only, and second, the weighting matrix of human cost function is retrieved by solving an LMI optimization problem with the learned feedback gain matrix. Finally, simulation and experiment results on a steering assist system of intelligent vehicles are presented to illustrate the effectiveness of the proposed method.

引用

页码：1004 / 1014

页数：11

共 50 条

[1] Human-in-the-Loop Behavior Modeling via an Integral Concurrent Adaptive Inverse Reinforcement Learning
Wu, Huai-Ning
Wang, Mi
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11359 - 11370
[2] Composite adaptive online inverse optimal control approach to human behavior learning
Lin, Jie
Wang, Mi
Wu, Huai-Ning
[J]. INFORMATION SCIENCES, 2023, 638
[3] A Finite-Horizon Inverse Linear Quadratic Optimal Control Method for Human-in-the-Loop Behavior Learning
Wu, Huai-Ning
Li, Wen-Hua
Wang, Mi
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3461 - 3470
[4] Human Behavior Learning for a Class of Nonlinear Human-in-the-Loop Systems via Takagi-Sugeno Fuzzy Model
Wu, Huai-Ning
Lin, Jie
Wang, Mi
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (06) : 3355 - 3367
[5] Reachable Set Estimation of a class of Human-in-the-Loop Control Systems
Zhang, Xiu-Mei
Wu, Huai-Ning
[J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1083 - 1087
[6] Distributed Formation Control for a Class of Human-in-the-Loop Multiagent Systems
Zhang, Xiao-Xiao
Wu, Huai-Ning
Wang, Jin-Liang
[J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (04) : 416 - 426
[7] Human models in human-in-the-loop control systems
Mabrok, Mohamed A.
Mohamed, Hassan K.
Abdel-Aty, Abdel-Haleem
Alzahrani, Ahmed S.
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 2611 - 2622
[8] Inverse Control for Inferring Intent in Novice Human-in-the-loop Iterative Learning
Warrier, Rahul B.
Devasia, Santosh
[J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 2148 - 2154
[9] "Weak" Control for Human-in-the-Loop Systems
Inoue, Masaki
Gupta, Vijay
[J]. IEEE CONTROL SYSTEMS LETTERS, 2019, 3 (02): : 440 - 445
[10] Stochastic Stability Analysis and Synthesis of a Class of Human-in-the-Loop Control Systems
Wu, Huai-Ning
Zhang, Xiu-Mei
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 822 - 832

← 1 2 3 4 5 →