Approximate dynamic programming for stochastic linear control problems on compact state spaces

被引:5
|
作者
Woerner, Stefan [1 ]
Laumanns, Marco [1 ]
Zenklusen, Rico [2 ]
Fertis, Apostolos [3 ]
机构
[1] IBM Res, CH-8803 Ruschlikon, Switzerland
[2] ETH, CH-8092 Zurich, Switzerland
[3] SMA & Partner AG, CH-8050 Zurich, Switzerland
关键词
Dynamic programming; Markov processes; Inventory; Dual sourcing; Multiple sourcing; MARKOV CONTROL PROCESSES; OPTIMAL POLICIES; INVENTORY MODEL; S POLICIES; OPTIMALITY;
D O I
10.1016/j.ejor.2014.08.003
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper addresses Markov Decision Processes over compact state and action spaces. We investigate the special case of linear dynamics and piecewise-linear and convex immediate costs for the average cost criterion. This model is very general and covers many interesting examples, for instance in inventory management. Due to the curse of dimensionality, the problem is intractable and optimal policies usually cannot be computed, not even for instances of moderate size. We show the existence of optimal policies and of convex and bounded relative value functions that solve the average cost optimality equation under reasonable and easy-to-check assumptions. Based on these insights, we propose an approximate relative value iteration algorithm based on piecewise-linear convex relative value function approximations. Besides computing good policies, the algorithm also provides lower bounds to the optimal average cost, which allow us to bound the optimality gap of any given policy for a given instance. The algorithm is applied to the well-studied Multiple Sourcing Problem as known from inventory management. Multiple sourcing is known to be a hard problem and usually tackled by parametric heuristics. We analyze several MSP instances with two and more suppliers and compare our results to state-of-the-art heuristics. For the considered scenarios, our policies are always at least as good as the best known heuristic, and strictly better in most cases. Moreover, by using the computed lower bounds we show for all instances that the optimality gap has never exceeded 5%, and that it has been much smaller for most of them. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:85 / 98
页数:14
相关论文
共 50 条
  • [1] Approximate Dynamic Programming for Continuous State and Control Problems
    Si, Jennie
    Yang, Lei
    Lu, Chao
    Sun, Jian
    Mei, Shengwei
    [J]. MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, : 1415 - 1420
  • [2] Approximate Dynamic Programming for Stochastic Resource Allocation Problems
    Ali Forootani
    Raffaele Iervolino
    Massimo Tipaldi
    Joshua Neilson
    [J]. IEEE/CAA Journal of Automatica Sinica, 2020, 7 (04) : 975 - 990
  • [3] Approximate dynamic programming for stochastic resource allocation problems
    Forootani, Ali
    Iervolino, Raffaele
    Tipaldi, Massimo
    Neilson, Joshua
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (04) : 975 - 990
  • [4] Approximate dynamic programming for network recovery problems with stochastic demand
    Ulusan, Aybike
    Ergun, Ozlem
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 151
  • [5] Linear programming formulations of stochastic control problems
    Kurtz, TG
    Stockbridge, RH
    [J]. PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 2662 - 2667
  • [6] Stable linear approximations to dynamic programming for stochastic control problems with local transitions
    VanRoy, B
    Tsitsiklis, JN
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1045 - 1051
  • [7] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
    Darbha, S.
    Krishnamoorthy, K.
    Pachter, M.
    Chandler, P.
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
  • [8] Dynamic programming for multidimensional stochastic control problems
    Jin Ma
    Jiongmin Yong
    [J]. Acta Mathematica Sinica, 1999, 15 : 485 - 506
  • [9] Dynamic Programming for Multidimensional Stochastic Control Problems
    Jin Ma Department of Mathematics
    [J]. Acta Mathematica Sinica,English Series, 1999, 15 (04) : 485 - 506
  • [10] Dynamic programming for multidimensional stochastic control problems
    Ma, J
    Yong, J
    [J]. ACTA MATHEMATICA SINICA-ENGLISH SERIES, 1999, 15 (04): : 485 - 506