A Data-Driven Policy Iteration Scheme based on Linear Programming

被引:0
|
作者
Banjac, Goran [1 ]
Lygeros, John [1 ]
机构
[1] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of learning discounted cost optimal control policies for unknown deterministic discrete time systems with continuous state and action spaces. We show that a policy evaluation step of the well-known policy iteration (PI) algorithm can be characterized as a solution to an infinite dimensional linear program (LP). However, when approximating such an LP with a finite dimensional program, the PI algorithm loses its nominal properties. We propose a data-driven PI scheme that ensures a certain monotonic behavior and allows for incorporation of expert knowledge on the system. A numerical example illustrates effectiveness of the proposed algorithm.
引用
下载
收藏
页码:816 / 821
页数:6
相关论文
共 50 条
  • [31] Data-Driven Computational Intelligence for Scientific Programming
    Rubio-Largo, Alvaro
    Carlos Preciado, Juan
    Iribarne, Luis
    SCIENTIFIC PROGRAMMING, 2019, 2019
  • [32] Towards the Creation of a Data-Driven Programming Tutor
    Mostafavi, Behrooz
    Barnes, Tiffany
    INTELLIGENT TUTORING SYSTEMS, PART II, 2010, 6095 : 239 - 241
  • [33] Linear Data-Driven Economic MPC with
    Xie, Yifan
    Berberich, Julian
    Allgoewer, Frank
    IFAC PAPERSONLINE, 2023, 56 (02): : 5512 - 5517
  • [34] A data-driven monitoring scheme for multivariate multimodal data
    Wang, Zhiqiong
    Gong, Renping
    Song, Lisha
    He, Shuguang
    Gao, Yuan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 192
  • [35] Distributed Data-Driven Power Iteration for Strongly Connected Networks
    Gusrialdi, Azwirman
    Qu, Zhihua
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 87 - 92
  • [36] Data-Driven Policy Making: The Policy Lab Approach
    van Veenstra, Anne Fleur
    Kotterink, Bas
    ELECTRONIC PARTICIPATION (EPART 2017), 2017, 10429 : 100 - 111
  • [37] A Data-Driven Passive Islanding Detection Scheme
    De, Sourav
    Reddy, Motakatla Venkateswara
    Sodhi, Ranjana
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2024, 60 (02) : 3698 - 3709
  • [38] A Data-Driven Passive Islanding Detection Scheme
    Reddy, M. Venkateswara
    De, Sourav
    Sodhi, Ranjana
    2022 IEEE 10TH POWER INDIA INTERNATIONAL CONFERENCE, PIICON, 2022,
  • [39] PERFORMANCE OF THE EFFICIENT DATA-DRIVEN EVALUATION SCHEME
    JOHNSON, D
    BERMAN, F
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 18 (03) : 340 - 346
  • [40] A Data-Driven Scheme for Quantitative Analysis of Texture
    Yafei Wang
    Chenfan Yu
    Leilei Xing
    Kailun Li
    Jinhan Chen
    Wei Liu
    Jing Ma
    Zhijian Shen
    Metallurgical and Materials Transactions A, 2020, 51 : 940 - 950