Linear inverse reinforcement learning in continuous time and space

被引:0
|
作者
Kamalapurkar, Rushikesh [1 ]
机构
[1] Oklahoma State Univ, Sch Mech & Aerosp Engn, Stillwater, OK 74078 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a data-driven inverse reinforcement learning technique for a class of linear systems to estimate the cost function of an agent online, using input-output measurements. A simultaneous state and parameter estimator is utilized to facilitate output-feedback inverse reinforcement learning, and cost function estimation is achieved up to multiplication by a constant.
引用
收藏
页码:1683 / 1688
页数:6
相关论文
共 50 条
  • [31] Reinforcement Learning For Sepsis Treatment: A Continuous Action Space Solution
    Huang, Yong
    Cao, Rui
    Rahmani, Amir
    [J]. MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 182, 2022, 182 : 631 - 647
  • [32] Online reinforcement learning for a continuous space system with experimental validation
    Dogru, Oguzhan
    Wieczorek, Nathan
    Velswamy, Kirubakaran
    Ibrahim, Fadi
    Huang, Biao
    [J]. JOURNAL OF PROCESS CONTROL, 2021, 104 : 86 - 100
  • [33] Relevance Vector Sampling for Reinforcement Learning in Continuous Action Space
    Lee, Minwoo
    Anderson, Charles W.
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 774 - 779
  • [34] CHOQUET REGULARIZATION FOR CONTINUOUS-TIME REINFORCEMENT LEARNING
    Han, Xia
    Wang, Ruodu
    Zhou, Xun Yu
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (05) : 2777 - 2801
  • [35] Online Solution to the Linear Quadratic Tracking Problem of Continuous-time Systems using Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    [J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3851 - 3856
  • [36] Continuous Deep Maximum Entropy Inverse Reinforcement Learning using online POMDP
    Silva, Junior A. R.
    Grassi Jr, Valdir
    Wolf, Denis Fernando
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2019, : 382 - 387
  • [37] Maximum Entropy Inverse Reinforcement Learning in Continuous State Spaces with Path Integrals
    Aghasadeghi, Navid
    Bretl, Timothy
    [J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 1561 - 1566
  • [38] Temporal difference learning in continuous time and space
    Doya, K
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1073 - 1079
  • [39] Repeated Inverse Reinforcement Learning
    Amin, Kareem
    Jiang, Nan
    Singh, Satinder
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [40] Cooperative Inverse Reinforcement Learning
    Hadfield-Menell, Dylan
    Dragan, Anca
    Abbeel, Pieter
    Russell, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29