Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently

被引:0
|
作者
Cassel, Asaf [1 ]
Cohen, Alon [2 ]
Koren, Tomer [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[2] Google Res, Tel Aviv, Israel
关键词
ADAPTIVE-CONTROL; IDENTIFICATION; PARAMETER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of learning in Linear Quadratic Control systems whose transition parameters are initially unknown. Recent results in this setting have demonstrated efficient learning algorithms with regret growing with the square root of the number of decision steps. We present new efficient algorithms that achieve, perhaps surprisingly, regret that scales only (poly)logarithmically with the number of steps in two scenarios: when only the state transition matrix A is unknown, and when only the state-action transition matrix B is unknown and the optimal policy satisfies a certain non-degeneracy condition. On the other hand, we give a lower bound that shows that when the latter condition is violated, square root regret is unavoidable.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Probabilistic robust design with linear quadratic regulators
    Polyak, BT
    Tempo, R
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1037 - 1042
  • [32] Remarks on Stability Robustness of Linear Quadratic Regulators
    Kim, Yoonsoo
    2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017, : 452 - 453
  • [33] Dynamic Gain Adaptation in Linear Quadratic Regulators
    Komaee, Arash
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (08) : 5094 - 5108
  • [34] CHEAP AND SINGULAR CONTROLS FOR LINEAR QUADRATIC REGULATORS
    SABERI, A
    SANNUTI, P
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1987, 32 (03) : 208 - 219
  • [35] Role of uncertainty in stochastic linear quadratic regulators
    Zhou, XY
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 1094 - 1099
  • [36] GENERALIZED ROBUSTNESS OF OPTIMALITY OF LINEAR QUADRATIC REGULATORS
    SUGIMOTO, K
    YAMAMOTO, Y
    INTERNATIONAL JOURNAL OF CONTROL, 1990, 51 (03) : 521 - 533
  • [37] Probabilistic robust design with linear quadratic regulators
    Polyak, BT
    Tempo, R
    SYSTEMS & CONTROL LETTERS, 2001, 43 (05) : 343 - 353
  • [38] Probabilistic robust design with linear quadratic regulators
    Institute for Control Science, Russian Academy of Sciences, Profsojuznaja 65, Moscow 117806
    不详
    Systems and Control Letters, 2001, 43 (05): : 343 - 353
  • [39] DESIGN OF LINEAR QUADRATIC REGULATORS WITH ASSIGNED EIGENSTRUCTURE
    EASTMAN, WL
    BOSSI, JA
    INTERNATIONAL JOURNAL OF CONTROL, 1984, 39 (04) : 731 - 742
  • [40] Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
    Dean, Sarah
    Mania, Horia
    Matni, Nikolai
    Recht, Benjamin
    Tu, Stephen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31