Perspectives of approximate dynamic programming

被引:53
|
作者
Powell, Warren B. [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
LEARNING CONTROL; SELECTION; ALGORITHMS; NETWORKS;
D O I
10.1007/s10479-012-1077-6
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Approximate dynamic programming has evolved, initially independently, within operations research, computer science and the engineering controls community, all searching for practical tools for solving sequential stochastic optimization problems. More so than other communities, operations research continued to develop the theory behind the basic model introduced by Bellman with discrete states and actions, even while authors as early as Bellman himself recognized its limits due to the "curse of dimensionality" inherent in discrete state spaces. In response to these limitations, subcommunities in computer science, control theory and operations research have developed a variety of methods for solving different classes of stochastic, dynamic optimization problems, creating the appearance of a jungle of competing approaches. In this article, we show that there is actually a common theme to these strategies, and underpinning the entire field remains the fundamental algorithmic strategies of value and policy iteration that were first introduced in the 1950's and 60's.
引用
收藏
页码:319 / 356
页数:38
相关论文
共 50 条
  • [1] A Survey of Approximate Dynamic Programming
    Wang Lin
    Peng Hui
    Zhu Hua-yong
    Shen Lin-cheng
    [J]. 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 396 - 399
  • [2] A LINEAR PROGRAMMING METHODOLOGY FOR APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Henry
    Sala, Antonio
    Armesto, Leopoldo
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2020, 30 (02) : 363 - 375
  • [3] The linear programming approach to approximate dynamic programming
    De Farias, DP
    Van Roy, B
    [J]. OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
  • [4] Approximate dynamic programming via linear programming
    de Farias, DP
    Van Roy, B
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 689 - 695
  • [5] Approximate Dynamic Programming via Sum of Squares Programming
    Summers, Tyler H.
    Kunz, Konstantin
    Kariotoglou, Nikolaos
    Kamgarpour, Maryam
    Summers, Sean
    Lygeros, John
    [J]. 2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 191 - 197
  • [6] Approximate dynamic programming with a fuzzy parameterization
    Busoniu, Lucian
    Ernst, Damien
    De Schutter, Bart
    Babuska, Robert
    [J]. AUTOMATICA, 2010, 46 (05) : 804 - 814
  • [7] Bayesian Exploration for Approximate Dynamic Programming
    Ryzhov, Ilya O.
    Mes, Martijn R. K.
    Powell, Warren B.
    van den Berg, Gerald
    [J]. OPERATIONS RESEARCH, 2019, 67 (01) : 198 - 214
  • [8] Approximate dynamic programming for stochastic reachability
    Kariotoglou, Nikolaos
    Summers, Sean
    Summers, Tyler
    Kamgarpour, Maryam
    Lygeros, John
    [J]. 2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 584 - 589
  • [9] Approximate dynamic programming for container stacking
    Boschma, Rene
    Mes, Martijn R. K.
    de Vries, Leon R.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 310 (01) : 328 - 342
  • [10] Dynamic Programming for Approximate Expansion Algorithm
    Veksler, Olga
    [J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 : 850 - 863