Linear inverse reinforcement learning in continuous time and space

被引:0
|
作者
Kamalapurkar, Rushikesh [1 ]
机构
[1] Oklahoma State Univ, Sch Mech & Aerosp Engn, Stillwater, OK 74078 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a data-driven inverse reinforcement learning technique for a class of linear systems to estimate the cost function of an agent online, using input-output measurements. A simultaneous state and parameter estimator is utilized to facilitate output-feedback inverse reinforcement learning, and cost function estimation is achieved up to multiplication by a constant.
引用
收藏
页码:1683 / 1688
页数:6
相关论文
共 50 条
  • [1] Reinforcement learning in continuous time and space
    Doya, K
    [J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245
  • [2] Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees
    Dexter, Gregory
    Bello, Kevin
    Honorio, Jean
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Barycentric interpolators for continuous space & time reinforcement learning
    Munos, R
    Moore, A
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1024 - 1030
  • [4] Reinforcement Learning in Continuous Time and Space: A Stochastic Control Approach
    Wang, Haoran
    Zariphopoulou, Thaleia
    Zhou, Xun Yu
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [5] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
    Tao Bian
    Zhong-Ping Jiang
    [J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (02) : 433 - 440
  • [6] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (02) : 433 - 440
  • [7] Robust Optimal Control of Continuous Time Linear System using Reinforcement Learning
    Sami, Abdul
    Memon, Attaullah Y.
    [J]. 2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 154 - 159
  • [8] Budgeted Reinforcement Learning in Continuous State Space
    Carrara, Nicolas
    Leurent, Edouard
    Laroche, Romain
    Urvoy, Tanguy
    Maillard, Odalric-Ambrym
    Pietquin, Olivier
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] Switching reinforcement learning for continuous action space
    Nagayoshi, Masato
    Murao, Hajime
    Tamaki, Hisashi
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2012, 95 (03) : 37 - 44
  • [10] On Applications of Bootstrap in Continuous Space Reinforcement Learning
    Faradonbeh, Mohamad Kazem Shirani
    Tewari, Ambuj
    Michailidis, George
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 1977 - 1984