Data-driven approximate dynamic programming: A linear programming approach

被引:0
|
作者
Sutter, Tobias [1 ]
Kamoutsi, Angeliki
Esfahani, Peyman Mohajerin
Lygeros, John
机构
[1] Swiss Fed Inst Technol, Automat Control Lab, Zurich, Switzerland
关键词
MARKOV DECISION-PROCESSES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents an approximation scheme for the infinite-dimensional linear programming formulation of discrete-time Markov control processes via a finite-dimensional convex program, when the dynamics are unknown and learned from data. We derive a probabilistic explicit error bound between the data-driven finite convex program and the original infinite linear program. We further discuss the sample complexity of the error bound which translates to the number of samples required for an a priori approximation accuracy. Our analysis sheds light on the impact of the choice of basis functions for approximating the true value function. Finally, the relevance of the method is illustrated on a truncated LQG problem.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] The linear programming approach to approximate dynamic programming
    De Farias, DP
    Van Roy, B
    [J]. OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
  • [2] On constraint sampling in the linear programming approach to approximate dynamic programming
    de Farias, DP
    Van Roy, B
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) : 462 - 478
  • [3] Data-Driven Control of Unknown Systems: A Linear Programming Approach
    Tanzanakis, Alexandros
    Lygeros, John
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 7 - 13
  • [4] Data-Driven Optimal Tracking with Constrained Approximate Dynamic Programming for Servomotor Systems
    Chakrabarty, Ankush
    Danielson, Claus
    Wang, Yebin
    [J]. 2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA), 2020, : 352 - 357
  • [5] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
    Darbha, S.
    Krishnamoorthy, K.
    Pachter, M.
    Chandler, P.
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
  • [6] Choosing the Cost Vector of the Linear Programming Approach to Approximate Dynamic Programming
    de Farias, Daniela Pucci
    Weber, Theophane
    [J]. 47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 67 - 72
  • [7] A LINEAR PROGRAMMING METHODOLOGY FOR APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Henry
    Sala, Antonio
    Armesto, Leopoldo
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (02) : 363 - 375
  • [8] Approximate dynamic programming via linear programming
    de Farias, DP
    Van Roy, B
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 689 - 695
  • [9] A data-driven approximate dynamic programming approach based on association rule learning: Spacecraft autonomy as a case study
    D'Angelo, Gianni
    Tipaldi, Massimo
    Palmieri, Francesco
    Glielmo, Luigi
    [J]. INFORMATION SCIENCES, 2019, 504 : 501 - 519
  • [10] On constraint sampling in the linear programming approach to approximate linear programming
    de Farias, DP
    Van Roy, B
    [J]. 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2441 - 2446