An application of convex optimization concepts to approximate dynamic programming

被引:1
|
作者
Arruda, Edilson F. [1 ]
Fragoso, Marcelo D. [1 ]
do Val, Joao Bosco R. [2 ]
机构
[1] Natl Lab Sci Computat, Dept Syst & Control, Petropolis, RJ, Brazil
[2] Univ Estadual Campinas, Sch Elect & Comp Engn, Dept Telemat, Campinas, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/ACC.2008.4587159
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic (DP) programming problems. The so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DIP problem. This fact motivates the introduction of an AVI algorithm with local search that seeks an approximate solution in a lower dimensional space called approximation architecture. The optimality of a point in the approximation architecture is characterized by means of convex optimization concepts and necessary and sufficient conditions to global optimality are derived. To illustrate the method, two examples are presented which were previously explored in the literature.
引用
收藏
页码:4238 / +
页数:3
相关论文
共 50 条
  • [1] Dynamic Programming in Convex Stochastic Optimization
    Pennanen, Teemu
    Perkkioe, Ari-Pekka
    [J]. JOURNAL OF CONVEX ANALYSIS, 2023, 30 (04) : 1241 - 1283
  • [2] An approximate dynamic programming approach to convex quadratic knapsack problems
    Hua, ZS
    Zhang, B
    Liang, L
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2006, 33 (03) : 660 - 673
  • [3] Markdown Optimization via Approximate Dynamic Programming
    Cosgun, Ozlem
    Kula, Ufuk
    Kahraman, Cengiz
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2013, 6 (01) : 64 - 78
  • [4] Markdown Optimization via Approximate Dynamic Programming
    Özlem Coşgun
    Ufuk Kula
    Cengiz Kahraman
    [J]. International Journal of Computational Intelligence Systems, 2013, 6 : 64 - 78
  • [5] Approximate convex multiparametric programming
    Bemporad, A
    Filippi, C
    [J]. 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 3185 - 3190
  • [6] Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
    Chapados, Nicolas
    Bengio, Yoshua
    [J]. JOURNAL OF COMPUTERS, 2007, 2 (01) : 12 - 19
  • [7] The K best-paths approach to approximate dynamic programming with application to portfolio optimization
    Chapados, Nicolas
    Bengio, Yoshua
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 491 - 502
  • [8] Dynamic programming approach to optimization of approximate decision rules
    Amin, Talha
    Chikalov, Igor
    Moshkov, Mikhail
    Zielosko, Beata
    [J]. INFORMATION SCIENCES, 2013, 221 : 403 - 418
  • [9] An algorithm for approximate multiparametric convex programming
    Bemporad, Alberto
    Filippi, Carlo
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2006, 35 (01) : 87 - 108
  • [10] AN APPROXIMATE METHOD FOR CONVEX-PROGRAMMING
    ABRAHAM, J
    [J]. ECONOMETRICA, 1961, 29 (04) : 700 - 703