Efficient sampling in approximate dynamic programming algorithms

被引:21
|
作者
Cervellera, Cristiano [1 ]
Muselli, Marco [1 ]
机构
[1] Ist Studi Sistemi Intelligenti Lautomaz, Consiglio Nazl Ric, I-16149 Genoa, Italy
关键词
stochastic optimal control problem; dynamic programming; sample complexity; deterministic learning; low-discrepancy sequences;
D O I
10.1007/s10589-007-9054-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimensional state space. Here the problem of sample complexity, i.e., how "fast" the number of points must grow with the input dimension in order to have an accurate estimate of the cost-to-go functions in typical DP approaches such as value iteration and policy iteration, is discussed. It is shown that a choice of the sampling based on low-discrepancy sequences, commonly used for efficient numerical integration, permits to achieve, under suitable hypotheses, an almost linear sample complexity, thus contributing to mitigate the curse of dimensionality of the approximate DP procedure.
引用
收藏
页码:417 / 443
页数:27
相关论文
共 50 条
  • [21] Efficient Parallelization Using Rank Convergence in Dynamic Programming Algorithms
    Maleki, Saeed
    Musuvathi, Madanlal
    Mytkowicz, Todd
    COMMUNICATIONS OF THE ACM, 2016, 59 (10) : 85 - 92
  • [22] The design of efficient dynamic programming and transfer matrix enumeration algorithms
    Conway, Andrew R.
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2017, 50 (35)
  • [23] Perspectives of approximate dynamic programming
    Powell, Warren B.
    ANNALS OF OPERATIONS RESEARCH, 2016, 241 (1-2) : 319 - 356
  • [24] A Survey of Approximate Dynamic Programming
    Wang Lin
    Peng Hui
    Zhu Hua-yong
    Shen Lin-cheng
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 396 - 399
  • [25] A LINEAR PROGRAMMING METHODOLOGY FOR APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Henry
    Sala, Antonio
    Armesto, Leopoldo
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (02) : 363 - 375
  • [26] The linear programming approach to approximate dynamic programming
    De Farias, DP
    Van Roy, B
    OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
  • [27] Approximate dynamic programming via linear programming
    de Farias, DP
    Van Roy, B
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 689 - 695
  • [28] On constraint sampling in the linear programming approach to approximate linear programming
    de Farias, DP
    Van Roy, B
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2441 - 2446
  • [29] Dynamic Programming Guided Exploration for Sampling-based Motion Planning Algorithms
    Arslan, Oktay
    Tsiotras, Panagiotis
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 4819 - 4826
  • [30] Approximate dynamic programming for an energy-efficient parallel machine scheduling problem
    Heydar, Mojtaba
    Mardaneh, Elham
    Loxton, Ryan
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 302 (01) : 363 - 380