The Fundamental Limitations of Learning Linear-Quadratic Regulators

被引:1
|
作者
Lee, Bruce D. [1 ]
Ziemann, Ingvar [1 ]
Tsiamis, Anastasios [2 ]
Sandberg, Henrik [3 ]
Matni, Nikolai [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
[2] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
[3] KTH Royal Inst Technol, Div Decis & Control Syst, Stockholm, Sweden
关键词
ADAPTIVE-CONTROL; IDENTIFICATION;
D O I
10.1109/CDC49753.2023.10383608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a local minimax lower bound on the excess cost of designing a linear-quadratic controller from offline data. The bound is valid for any offline exploration policy that consists of a stabilizing controller and an energy bounded exploratory input. The derivation leverages a relaxation of the minimax estimation problem to Bayesian estimation, and an application of van Trees inequality. We show that the bound aligns with system-theoretic intuition. In particular, we demonstrate that the lower bound increases when the optimal control objective value increases. We also show that the lower bound increases when the system is poorly excitable, as characterized by the spectrum of the controllability gramian of the system mapping the noise to the state and the H-infinity norm of the system mapping the input to the state. We further show that for some classes of systems, the lower bound may be exponential in the state dimension, demonstrating exponential sample complexity for learning the linear-quadratic regulator.
引用
收藏
页码:4053 / 4060
页数:8
相关论文
共 50 条
  • [31] Pricing in Linear-Quadratic Dynamic Games
    Ratliff, Lillian J.
    Coogan, Samuel
    Calderone, Daniel
    Sastry, S. Shankar
    2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 1798 - 1805
  • [32] Linear-Quadratic Mean Field Games
    Bensoussan, A.
    Sung, K. C. J.
    Yam, S. C. P.
    Yung, S. P.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 169 (02) : 496 - 529
  • [33] A Study of Piecewise Linear-Quadratic Programs
    Cui, Ying
    Chang, Tsung-Hui
    Hong, Mingyi
    Pang, Jong-Shi
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2020, 186 (02) : 523 - 553
  • [34] LINEAR-QUADRATIC STATIONKEEPING FOR THE STS ORBITER
    REDDING, DC
    ADAMS, NJ
    KUBIAK, ET
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1989, 12 (02) : 248 - 255
  • [35] A Study of Piecewise Linear-Quadratic Programs
    Ying Cui
    Tsung-Hui Chang
    Mingyi Hong
    Jong-Shi Pang
    Journal of Optimization Theory and Applications, 2020, 186 : 523 - 553
  • [36] The mechanistic basis of the linear-quadratic formalism
    Sachs, RK
    Brenner, DJ
    MEDICAL PHYSICS, 1998, 25 (10) : 2071 - 2073
  • [37] LINEAR-QUADRATIC FRACTIONAL GAUSSIAN CONTROL
    Duncan, Tyrone E.
    Pasik-Duncan, Bozenna
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (06) : 4504 - 4519
  • [38] On Linear-Quadratic Gaussian Dynamic Games
    Pachter, Meir
    ADVANCES IN DYNAMIC AND MEAN FIELD GAMES: THEORY, APPLICATIONS, AND NUMERICAL METHODS, 2017, 15 : 301 - 322
  • [39] Linear-Quadratic Mean Field Games
    A. Bensoussan
    K. C. J. Sung
    S. C. P. Yam
    S. P. Yung
    Journal of Optimization Theory and Applications, 2016, 169 : 496 - 529
  • [40] Repopulation Kinetics and the Linear-Quadratic Model
    O'Rourke, S. F. C.
    McAneney, H.
    Starrett, C.
    O'Sullivan, J. M.
    COMPUTATIONAL METHODS IN SCIENCE AND ENGINEERING, VOL 2: ADVANCES IN COMPUTATIONAL SCIENCE, 2009, 1148 : 209 - +