The Fundamental Limitations of Learning Linear-Quadratic Regulators

被引:1
|
作者
Lee, Bruce D. [1 ]
Ziemann, Ingvar [1 ]
Tsiamis, Anastasios [2 ]
Sandberg, Henrik [3 ]
Matni, Nikolai [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
[3] KTH Royal Inst Technol, Div Decis & Control Syst, Stockholm, Sweden
关键词
ADAPTIVE-CONTROL; IDENTIFICATION;
D O I
10.1109/CDC49753.2023.10383608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a local minimax lower bound on the excess cost of designing a linear-quadratic controller from offline data. The bound is valid for any offline exploration policy that consists of a stabilizing controller and an energy bounded exploratory input. The derivation leverages a relaxation of the minimax estimation problem to Bayesian estimation, and an application of van Trees inequality. We show that the bound aligns with system-theoretic intuition. In particular, we demonstrate that the lower bound increases when the optimal control objective value increases. We also show that the lower bound increases when the system is poorly excitable, as characterized by the spectrum of the controllability gramian of the system mapping the noise to the state and the H-infinity norm of the system mapping the input to the state. We further show that for some classes of systems, the lower bound may be exponential in the state dimension, demonstrating exponential sample complexity for learning the linear-quadratic regulator.
引用
收藏
页码:4053 / 4060
页数:8
相关论文
共 50 条
  • [21] RISK AND LINEAR-QUADRATIC STABILIZATION
    VANDERPLOEG, F
    ECONOMICS LETTERS, 1984, 15 (1-2) : 73 - 78
  • [22] Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently
    Cassel, Asaf
    Cohen, Alon
    Koren, Tomer
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [23] Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently
    Cassel, Asaf
    Cohen, Alon
    Koren, Tomer
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [24] Inverse Stochastic Optimal Control for Linear-Quadratic Gaussian and Linear-Quadratic Sensorimotor Control Models
    Karg, Philipp
    Stoll, Simon
    Rothfuss, Simon
    Hohmann, Soren
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2801 - 2808
  • [25] Linear-quadratic optimal control with integral quadratic constraints
    Lim, AEB
    Liu, YQ
    Teo, KL
    Moore, JB
    OPTIMAL CONTROL APPLICATIONS & METHODS, 1999, 20 (02): : 79 - 92
  • [26] Exponential stability assignment of neutral delay-differential systems by a class of linear-quadratic regulators
    Kubo, T
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2004, 35 (03) : 159 - 166
  • [27] Linear-Quadratic regulators for internal boundary control of lane-free automated vehicle traffic
    Malekzadeh, Milad
    Papamichail, Ioannis
    Papageorgiou, Markos
    CONTROL ENGINEERING PRACTICE, 2021, 115
  • [28] Linear-quadratic optimal control with integral quadratic constraints
    Department of Systems Engineering, Res. Sch. of Info. Sci. and Eng., Australian National University, Canberra, ACT 0200, Australia
    不详
    Optim Control Appl Methods, 2 (79-92):
  • [29] Optimal Robust Linear-Quadratic Stabilization
    Balandin, D. V.
    Kogan, M. M.
    DIFFERENTIAL EQUATIONS, 2007, 43 (11) : 1611 - 1615
  • [30] Linear-Quadratic Optimal Actuator Location
    Morris, Kirsten
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (01) : 113 - 124