Data-driven approximate dynamic programming: A linear programming approach

被引：0

作者：

Sutter, Tobias ^{[1
]}

Kamoutsi, Angeliki

Esfahani, Peyman Mohajerin

Lygeros, John

机构：

[1] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland

来源：

2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2017年

关键词：

MARKOV DECISION-PROCESSES;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article presents an approximation scheme for the infinite-dimensional linear programming formulation of discrete-time Markov control processes via a finite-dimensional convex program, when the dynamics are unknown and learned from data. We derive a probabilistic explicit error bound between the data-driven finite convex program and the original infinite linear program. We further discuss the sample complexity of the error bound which translates to the number of samples required for an a priori approximation accuracy. Our analysis sheds light on the impact of the choice of basis functions for approximating the true value function. Finally, the relevance of the method is illustrated on a truncated LQG problem.

引用

页数：6

共 50 条

[1] The linear programming approach to approximate dynamic programming
De Farias, DP
Van Roy, B
[J]. OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
[2] On constraint sampling in the linear programming approach to approximate dynamic programming
de Farias, DP
Van Roy, B
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) : 462 - 478
[3] Data-Driven Control of Unknown Systems: A Linear Programming Approach
Tanzanakis, Alexandros
Lygeros, John
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 7 - 13
[4] Data-Driven Optimal Tracking with Constrained Approximate Dynamic Programming for Servomotor Systems
Chakrabarty, Ankush
Danielson, Claus
Wang, Yebin
[J]. 2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA), 2020, : 352 - 357
[5] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
Darbha, S.
Krishnamoorthy, K.
Pachter, M.
Chandler, P.
[J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
[6] Choosing the Cost Vector of the Linear Programming Approach to Approximate Dynamic Programming
de Farias, Daniela Pucci
Weber, Theophane
[J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 67 - 72
[7] A LINEAR PROGRAMMING METHODOLOGY FOR APPROXIMATE DYNAMIC PROGRAMMING
Diaz, Henry
Sala, Antonio
Armesto, Leopoldo
[J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (02) : 363 - 375
[8] Approximate dynamic programming via linear programming
de Farias, DP
Van Roy, B
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 689 - 695
[9] A data-driven approximate dynamic programming approach based on association rule learning: Spacecraft autonomy as a case study
D'Angelo, Gianni
Tipaldi, Massimo
Palmieri, Francesco
Glielmo, Luigi
[J]. INFORMATION SCIENCES, 2019, 504 : 501 - 519
[10] On constraint sampling in the linear programming approach to approximate linear programming
de Farias, DP
Van Roy, B
[J]. 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2441 - 2446

← 1 2 3 4 5 →