Sensitivity-based nested partitions for solving finite-horizon Markov decision processes

被引：1

作者：

Chen, Weiwei ^{[1
]}

机构：

[1] Rutgers State Univ, Dept Supply Chain Management, 1 Washington Pk, Newark, NJ 07102 USA

来源：

OPERATIONS RESEARCH LETTERS | 2017年 / 45卷 / 05期

关键词：

Approximate dynamic programming; Markov decision processes; Nested partitions; Sensitivity-based approach;

D O I：

10.1016/j.orl.2017.07.006

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we propose a heuristic for solving finite-horizon Markov decision processes. The heuristic uses the nested partitions (NP) framework to guide an iterative search for the optimal policy. NP focuses the search on certain promising subregions, flexibly determined by the sampling weight of each action branch. Within each subregion, an effective local policy optimization is developed using sensitivity-based approach, which optimizes the sampling weights based on estimated gradient information. Numerical results show the effectiveness of the proposed heuristic. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：481 / 487

页数：7

共 50 条

[41] A Sensitivity-Based Construction Approach to Sample-Path Variance Minimization of Markov Decision Processes
Huang, Yonghao
Chen, Xi
[J]. 2012 2ND AUSTRALIAN CONTROL CONFERENCE (AUCC), 2012, : 215 - 220
[42] Evaluation of CUSUM Charts for Finite-Horizon Processes
Nenes, George
Tagaras, George
[J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2010, 39 (03) : 578 - 597
[43] FINITE STATE CONTINUOUS TIME MARKOV DECISION PROCESSES WITH A FINITE PLANNING HORIZON
MILLER, BL
[J]. SIAM JOURNAL ON CONTROL, 1968, 6 (02): : 266 - &
[44] Linear programming formulation for non-stationary, finite-horizon Markov decision process models
Bhattacharya, Arnab
Kharoufeh, Jeffrey P.
[J]. OPERATIONS RESEARCH LETTERS, 2017, 45 (06) : 570 - 574
[45] Decomposition Methods for Solving Finite-Horizon Large MDPs
el Akraoui, Bouchra
Daoui, Cherki
Larach, Abdelhadi
Rahhali, Khalid
[J]. JOURNAL OF MATHEMATICS, 2022, 2022
[46] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
Ichihara, Naoyuki
[J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2021, 84 (02): : 2177 - 2220
[47] An Approximate Stochastic Annealing Algorithm for Finite Horizon Markov Decision Processes
Hu, Jiaqiao
Chang, Hyeong Soo
[J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 5338 - 5343
[48] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
Naoyuki Ichihara
[J]. Applied Mathematics & Optimization, 2021, 84 : 2177 - 2220
[49] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Guo, Xianping
Huang, Yonghui
Zhang, Yi
[J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02): : 317 - 341
[50] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes
Guin, Soumyajit
Bhatnagar, Shalabh
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359

← 1 2 3 4 5 →