Inverse linear-quadratic discrete-time finite-horizon optimal control for indistinguishable homogeneous agents: A convex optimization approach

被引：6

作者：

Zhang, Han ^{[1
,2
,3
]}

Ringh, Axel ^{[4
,5
,6
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China

[2] Minist Educ China, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China

[3] Shanghai Engn Res Ctr Intelligent Control & Manag, Shanghai 200240, Peoples R China

[4] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden

[5] Univ Gothenburg, S-41296 Gothenburg, Sweden

[6] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China

来源：

AUTOMATICA | 2023年 / 148卷

基金：

中国国家自然科学基金;

关键词：

Inverse optimal control; Linear quadratic regulator; System identification; Closed-loop identification; Time-varying system matrices; Convex optimization; Semidefinite programming;

D O I：

10.1016/j.automatica.2022.110758

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The inverse linear-quadratic optimal control problem is a system identification problem whose aim is to recover the quadratic cost function and hence the closed-loop system matrices based on observations of optimal trajectories. In this paper, the discrete-time, finite-horizon case is considered, where the agents are also assumed to be homogeneous and indistinguishable. The latter means that the agents all have the same dynamics and objective functions and the observations are in terms of "snap shots" of all agents at different time instants, but what is not known is "which agent moved where" for consecutive observations. This absence of linked optimal trajectories makes the problem challenging. We first show that this problem is globally identifiable. Then, for the case of noiseless observations, we show that the true cost matrix, and hence the closed-loop system matrices, can be recovered as the unique global optimal solution to a convex optimization problem. Next, for the case of noisy observations, we formulate an estimator as the unique global optimal solution to a modified convex optimization problem. Moreover, the statistical consistency of this estimator is shown. Finally, the performance of the proposed method is demonstrated by a number of numerical examples. (c) 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页数：12

共 50 条

[21] A unified approach to finite-horizon generalized LQ optimal control problems for discrete-time systems
Ferrante, Augusto
Ntogramatzidis, Lorenzo
LINEAR ALGEBRA AND ITS APPLICATIONS, 2007, 425 (2-3) : 242 - 260
[22] LINEAR-QUADRATIC OPTIMAL CONTROL FOR DISCRETE-TIME STOCHASTIC DESCRIPTOR SYSTEMS
Shu, Yadong
Li, Bo
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2022, 18 (03) : 1583 - 1602
[23] Discrete-Time Stochastic Linear-Quadratic Optimal Control with Time-Inconsistency
Li, Xun
Ni, Yuan-Hua
Zhang, Ji-Feng
IFAC PAPERSONLINE, 2015, 48 (28): : 691 - 696
[24] Adaptive dynamic programming for finite-horizon optimal control of linear time-varying discrete-time systems
Pang, Bo
Bian, Tao
Jiang, Zhong-Ping
CONTROL THEORY AND TECHNOLOGY, 2019, 17 (01) : 73 - 84
[25] Data-driven Finite-horizon Optimal Control for Linear Time-varying Discrete-time Systems
Pang, Bo
Bian, Tao
Jiang, Zhong-Ping
2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 861 - 866
[26] Adaptive dynamic programming for finite-horizon optimal control of linear time-varying discrete-time systems
Bo Pang
Tao Bian
Zhong-Ping Jiang
Control Theory and Technology, 2019, 17 : 73 - 84
[27] A nested computational approach to the discrete-time finite-horizon LQ control problem
Marro, G
Prattichizzo, D
Zattoni, E
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (03) : 1002 - 1012
[28] Algebraic Approach to Nonlinear Finite-Horizon Optimal Control Problems of Discrete-Time Systems with Terminal Constraints
Iori, Tomoyuki
Kawano, Yu
Ohtsuka, Toshiyuki
2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 220 - 225
[29] Statistically Consistent Inverse Optimal Control for Linear-Quadratic Tracking with Random Time Horizon
Zhang, Han
Ringh, Axel
Jiang, Weihan
Li, Shaoyuan
Hu, Xiaoming
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 1515 - 1522
[30] General multiple linear-quadratic control in discrete-time
Lam, SS
Li, D
PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 4170 - 4171

← 1 2 3 4 5 →