Linear inverse reinforcement learning in continuous time and space

被引：0

作者：

Kamalapurkar, Rushikesh ^{[1
]}

机构：

[1] Oklahoma State Univ, Sch Mech & Aerosp Engn, Stillwater, OK 74078 USA

来源：

2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper develops a data-driven inverse reinforcement learning technique for a class of linear systems to estimate the cost function of an agent online, using input-output measurements. A simultaneous state and parameter estimator is utilized to facilitate output-feedback inverse reinforcement learning, and cost function estimation is achieved up to multiplication by a constant.

引用

页码：1683 / 1688

页数：6

共 50 条

[31] Reinforcement Learning For Sepsis Treatment: A Continuous Action Space Solution
Huang, Yong
Cao, Rui
Rahmani, Amir
[J]. MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 182, 2022, 182 : 631 - 647
[32] Online reinforcement learning for a continuous space system with experimental validation
Dogru, Oguzhan
Wieczorek, Nathan
Velswamy, Kirubakaran
Ibrahim, Fadi
Huang, Biao
[J]. JOURNAL OF PROCESS CONTROL, 2021, 104 : 86 - 100
[33] Relevance Vector Sampling for Reinforcement Learning in Continuous Action Space
Lee, Minwoo
Anderson, Charles W.
[J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 774 - 779
[34] CHOQUET REGULARIZATION FOR CONTINUOUS-TIME REINFORCEMENT LEARNING
Han, Xia
Wang, Ruodu
Zhou, Xun Yu
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (05) : 2777 - 2801
[35] Online Solution to the Linear Quadratic Tracking Problem of Continuous-time Systems using Reinforcement Learning
Modares, Hamidreza
Lewis, Frank L.
[J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3851 - 3856
[36] Continuous Deep Maximum Entropy Inverse Reinforcement Learning using online POMDP
Silva, Junior A. R.
Grassi Jr, Valdir
Wolf, Denis Fernando
[J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2019, : 382 - 387
[37] Maximum Entropy Inverse Reinforcement Learning in Continuous State Spaces with Path Integrals
Aghasadeghi, Navid
Bretl, Timothy
[J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 1561 - 1566
[38] Temporal difference learning in continuous time and space
Doya, K
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1073 - 1079
[39] Repeated Inverse Reinforcement Learning
Amin, Kareem
Jiang, Nan
Singh, Satinder
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[40] Cooperative Inverse Reinforcement Learning
Hadfield-Menell, Dylan
Dragan, Anca
Abbeel, Pieter
Russell, Stuart
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29

← 1 2 3 4 5 →