A Sensitivity-Based Construction Approach to Variance Minimization of Markov Decision Processes

被引:1
|
作者
Huang, Yonghao [1 ]
Chen, Xi [2 ]
机构
[1] Chinese Insurance Informat Technol Co Ltd, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Variance minimization; sensitivity-based approach; Markov decision process; SAMPLE-PATH OPTIMALITY; PERFORMANCE SENSITIVITIES; OPTIMIZATION;
D O I
10.1002/asjc.1875
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With a long-run average performance as the primary criterion for a Markov decision process, variance measures are studied as its secondary criteria. The steady-state variance and the limiting average variance along a sample path are discussed. The latter one is difficult to handle due to its special form. With a sensitivity-based approach, the difference formula for the sample-path variance under different policies is intuitively constructed and then the optimality equation is presented. Moreover a policy iteration algorithm is developed. This work extends the sensitivity-based construction approach to Markov decision processes with non-standard performance criteria. The difference between these two types of variance and bias criteria is illustrated with a numerical example.
引用
收藏
页码:1166 / 1178
页数:13
相关论文
共 50 条
  • [1] A Sensitivity-Based Construction Approach to Sample-Path Variance Minimization of Markov Decision Processes
    Huang, Yonghao
    Chen, Xi
    [J]. 2012 2ND AUSTRALIAN CONTROL CONFERENCE (AUCC), 2012, : 215 - 220
  • [2] Markov decision processes with variance minimization: A new condition and approach
    Zhu, Quanxin
    Guo, Xianping
    [J]. STOCHASTIC ANALYSIS AND APPLICATIONS, 2007, 25 (03) : 577 - 592
  • [3] Variance minimization of parameterized Markov decision processes
    Xia, Li
    [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (01): : 63 - 81
  • [4] Variance minimization of parameterized Markov decision processes
    Li Xia
    [J]. Discrete Event Dynamic Systems, 2018, 28 : 63 - 81
  • [5] Semi-Markov decision processes with variance minimization criterion
    Wei, Qingda
    Guo, Xianping
    [J]. 4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2015, 13 (01): : 59 - 79
  • [6] Semi-Markov decision processes with variance minimization criterion
    Qingda Wei
    Xianping Guo
    [J]. 4OR, 2015, 13 : 59 - 79
  • [7] Sensitivity-based nested partitions for solving finite-horizon Markov decision processes
    Chen, Weiwei
    [J]. OPERATIONS RESEARCH LETTERS, 2017, 45 (05) : 481 - 487
  • [8] A Sensitivity-Based Approach for the Control of Repetitive Processes
    Rauh, Andreas
    Senkel, Luise
    Dittrich, Christina
    Aschemann, Harald
    Galkowski, Krzysztof
    Dabkowski, Pawel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2013,
  • [9] Variance minimization for continuous-time Markov decision processes: two approaches
    Zhu Quan-xin
    [J]. APPLIED MATHEMATICS-A JOURNAL OF CHINESE UNIVERSITIES SERIES B, 2010, 25 (04): : 400 - 410
  • [10] Variance minimization for continuous-time Markov decision processes: two approaches
    Quan-xin Zhu
    [J]. Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410