A Data-Driven Policy Iteration Scheme based on Linear Programming

被引:0
|
作者
Banjac, Goran [1 ]
Lygeros, John [1 ]
机构
[1] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of learning discounted cost optimal control policies for unknown deterministic discrete time systems with continuous state and action spaces. We show that a policy evaluation step of the well-known policy iteration (PI) algorithm can be characterized as a solution to an infinite dimensional linear program (LP). However, when approximating such an LP with a finite dimensional program, the PI algorithm loses its nominal properties. We propose a data-driven PI scheme that ensures a certain monotonic behavior and allows for incorporation of expert knowledge on the system. A numerical example illustrates effectiveness of the proposed algorithm.
引用
收藏
页码:816 / 821
页数:6
相关论文
共 50 条
  • [1] On-Policy Data-Driven Linear Quadratic Regulator via Combined Policy Iteration and Recursive Least Squares
    Sforni, Lorenzo
    Carnevale, Guido
    Notarnicola, Ivano
    Notarstefano, Giuseppe
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5047 - 5052
  • [2] Data-driven approximate dynamic programming: A linear programming approach
    Sutter, Tobias
    Kamoutsi, Angeliki
    Esfahani, Peyman Mohajerin
    Lygeros, John
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [3] Data-Driven Policy Iteration for Nonlinear Optimal Control Problems
    Possieri, Corrado
    Sassano, Mario
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7365 - 7376
  • [4] Data-Driven Structured Policy Iteration for Homogeneous Distributed Systems
    Alemzadeh, Siavash
    Talebi, Shahriar
    Mesbahi, Mehran
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (09) : 5979 - 5994
  • [5] The role of identification in data-driven policy iteration: A system theoretic study
    Song, Bowen
    Iannelli, Andrea
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [6] Data-Driven Control of Positive Linear Systems using Linear Programming
    Miller, Jared
    Dai, Tianyu
    Sznaier, Mario
    Shafai, Bahram
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1588 - 1594
  • [7] Data-Driven Control of Unknown Systems: A Linear Programming Approach
    Tanzanakis, Alexandros
    Lygeros, John
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 7 - 13
  • [8] Data-driven Linear Quadratic Regulation via Semidefinite Programming
    Rotulo, Monica
    De Persis, Claudio
    Tesi, Pietro
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 3995 - 4000
  • [9] Data-driven policy
    Lansky, David
    [J]. ISSUES IN SCIENCE AND TECHNOLOGY, 2007, 24 (01) : 11 - 14
  • [10] Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2016, 71 : 348 - 360