Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently

被引:0
|
作者
Cassel, Asaf [1 ]
Cohen, Alon [2 ]
Koren, Tomer [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[2] Google Res, Tel Aviv, Israel
关键词
ADAPTIVE-CONTROL; IDENTIFICATION; PARAMETER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of learning in Linear Quadratic Control systems whose transition parameters are initially unknown. Recent results in this setting have demonstrated efficient learning algorithms with regret growing with the square root of the number of decision steps. We present new efficient algorithms that achieve, perhaps surprisingly, regret that scales only (poly)logarithmically with the number of steps in two scenarios: when only the state transition matrix A is unknown, and when only the state-action transition matrix B is unknown and the optimal policy satisfies a certain non-degeneracy condition. On the other hand, we give a lower bound that shows that when the latter condition is violated, square root regret is unavoidable.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] On a Phase Transition of Regret in Linear Quadratic Control: The Memoryless Case
    Ziemann, Ingvar
    Sandberg, Henrik
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (02): : 695 - 700
  • [42] PERFORMANCE OF DIGITAL LINEAR REGULATORS WHICH USE LOGARITHMIC ARITHMETIC
    LAMAIRE, RO
    LANG, JH
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1986, 31 (05) : 394 - 400
  • [43] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds
    Lin Wang
    Xin Wang
    Tao Li
    Ruijuan Zheng
    Junlong Zhu
    Mingchuan Zhang
    Neural Computing and Applications, 2023, 35 : 8051 - 8063
  • [44] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds
    Wang, Lin
    Wang, Xin
    Li, Tao
    Zheng, Ruijuan
    Zhu, Junlong
    Zhang, Mingchuan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11): : 8051 - 8063
  • [45] Logarithmic Regret for Online Control
    Agarwal, Naman
    Hazan, Elad
    Singh, Karan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] Performance Analysis of a Class of Linear Quadratic Regulators for Switched Linear Systems
    Antunes, D.
    Heemels, W. P. M. H.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 5475 - 5480
  • [47] A revisit to the gain and phase margins of linear quadratic regulators
    Holmberg, U
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2001, 46 (09) : 1508 - 1509
  • [48] Risk-Constrained Linear-Quadratic Regulators
    Tsiamis, Anastasios
    Kalogerias, Dionysios S.
    Chamon, Luiz F. O.
    Ribeiro, Alejandro
    Pappas, George J.
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3040 - 3047
  • [49] Decay rate estimations for linear quadratic optimal regulators
    Estevez, Daniel
    Yakubovich, Dmitry V.
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2013, 439 (11) : 3332 - 3358
  • [50] LINEAR QUADRATIC REGULATORS WITH EIGENVALUE PLACEMENT IN A SPECIFIED REGION
    SHIEH, LS
    DIB, HM
    GANESAN, S
    AUTOMATICA, 1988, 24 (06) : 819 - 823