Sensitivity-based nested partitions for solving finite-horizon Markov decision processes

被引:1
|
作者
Chen, Weiwei [1 ]
机构
[1] Rutgers State Univ, Dept Supply Chain Management, 1 Washington Pk, Newark, NJ 07102 USA
关键词
Approximate dynamic programming; Markov decision processes; Nested partitions; Sensitivity-based approach;
D O I
10.1016/j.orl.2017.07.006
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we propose a heuristic for solving finite-horizon Markov decision processes. The heuristic uses the nested partitions (NP) framework to guide an iterative search for the optimal policy. NP focuses the search on certain promising subregions, flexibly determined by the sampling weight of each action branch. Within each subregion, an effective local policy optimization is developed using sensitivity-based approach, which optimizes the sampling weights based on estimated gradient information. Numerical results show the effectiveness of the proposed heuristic. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:481 / 487
页数:7
相关论文
共 50 条
  • [41] A Sensitivity-Based Construction Approach to Sample-Path Variance Minimization of Markov Decision Processes
    Huang, Yonghao
    Chen, Xi
    [J]. 2012 2ND AUSTRALIAN CONTROL CONFERENCE (AUCC), 2012, : 215 - 220
  • [42] Evaluation of CUSUM Charts for Finite-Horizon Processes
    Nenes, George
    Tagaras, George
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2010, 39 (03) : 578 - 597
  • [43] FINITE STATE CONTINUOUS TIME MARKOV DECISION PROCESSES WITH A FINITE PLANNING HORIZON
    MILLER, BL
    [J]. SIAM JOURNAL ON CONTROL, 1968, 6 (02): : 266 - &
  • [44] Linear programming formulation for non-stationary, finite-horizon Markov decision process models
    Bhattacharya, Arnab
    Kharoufeh, Jeffrey P.
    [J]. OPERATIONS RESEARCH LETTERS, 2017, 45 (06) : 570 - 574
  • [45] Decomposition Methods for Solving Finite-Horizon Large MDPs
    el Akraoui, Bouchra
    Daoui, Cherki
    Larach, Abdelhadi
    Rahhali, Khalid
    [J]. JOURNAL OF MATHEMATICS, 2022, 2022
  • [46] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
    Ichihara, Naoyuki
    [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2021, 84 (02): : 2177 - 2220
  • [47] An Approximate Stochastic Annealing Algorithm for Finite Horizon Markov Decision Processes
    Hu, Jiaqiao
    Chang, Hyeong Soo
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 5338 - 5343
  • [48] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
    Naoyuki Ichihara
    [J]. Applied Mathematics & Optimization, 2021, 84 : 2177 - 2220
  • [49] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
    Guo, Xianping
    Huang, Yonghui
    Zhang, Yi
    [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02): : 317 - 341
  • [50] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes
    Guin, Soumyajit
    Bhatnagar, Shalabh
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359