The Fundamental Limitations of Learning Linear-Quadratic Regulators

被引:1
|
作者
Lee, Bruce D. [1 ]
Ziemann, Ingvar [1 ]
Tsiamis, Anastasios [2 ]
Sandberg, Henrik [3 ]
Matni, Nikolai [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
[3] KTH Royal Inst Technol, Div Decis & Control Syst, Stockholm, Sweden
关键词
ADAPTIVE-CONTROL; IDENTIFICATION;
D O I
10.1109/CDC49753.2023.10383608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a local minimax lower bound on the excess cost of designing a linear-quadratic controller from offline data. The bound is valid for any offline exploration policy that consists of a stabilizing controller and an energy bounded exploratory input. The derivation leverages a relaxation of the minimax estimation problem to Bayesian estimation, and an application of van Trees inequality. We show that the bound aligns with system-theoretic intuition. In particular, we demonstrate that the lower bound increases when the optimal control objective value increases. We also show that the lower bound increases when the system is poorly excitable, as characterized by the spectrum of the controllability gramian of the system mapping the noise to the state and the H-infinity norm of the system mapping the input to the state. We further show that for some classes of systems, the lower bound may be exponential in the state dimension, demonstrating exponential sample complexity for learning the linear-quadratic regulator.
引用
收藏
页码:4053 / 4060
页数:8
相关论文
共 50 条
  • [1] On adaptive Linear-Quadratic regulators
    Faradonbeh, Mohamad Kazem Shirani
    Tewari, Ambuj
    Michailidis, George
    AUTOMATICA, 2020, 117
  • [2] Learning Linear-Quadratic Regulators Efficiently with only √T Regret
    Cohen, Alon
    Koren, Tomer
    Mansour, Yishay
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [3] The composition synthesis of linear-quadratic regulators
    Dubovik, Sergey A.
    Journal of Automation and Information Sciences, 1999, 31 (7-9): : 33 - 42
  • [4] A CLASS OF LINEAR-QUADRATIC DISCRETE REGULATORS
    MAHMOUD, MS
    BAHNASAWI, AA
    JOURNAL OF THE UNIVERSITY OF KUWAIT-SCIENCE, 1993, 20 (02): : 227 - 235
  • [5] The composition synthesis of linear-quadratic regulators
    Dubovik, S.A.
    Problemy Upravleniya I Informatiki (Avtomatika), 1999, (02): : 50 - 62
  • [6] Risk-Constrained Linear-Quadratic Regulators
    Tsiamis, Anastasios
    Kalogerias, Dionysios S.
    Chamon, Luiz F. O.
    Ribeiro, Alejandro
    Pappas, George J.
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3040 - 3047
  • [7] Universal regulators in linear-quadratic problem of optimal tracking
    Yakubovich, VA
    DOKLADY AKADEMII NAUK, 1996, 348 (03) : 313 - 317
  • [9] Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation
    Abeille, Marc
    Lazaric, Alessandro
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [10] PROPORTIONAL-INTEGRAL DERIVATIVE REALIZATION OF OPTIMAL LINEAR-QUADRATIC REGULATORS
    CALOVIC, MS
    CUK, NM
    PROCEEDINGS OF THE INSTITUTION OF ELECTRICAL ENGINEERS-LONDON, 1974, 121 (11): : 1441 - 1443