Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

被引：4

作者：

Huang, Yonghui ^{[1
]}

Guo, Xianping ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2015年 / 72卷 / 02期

关键词：

Finite horizon semi-Markov decision processes; Mean-variance optimal policy; Dynamic programming; Value iteration; Policy improvement; Linear programming; PORTFOLIO SELECTION; RISK PROBABILITY; REWARD VARIANCE; MINIMIZATION;

D O I：

10.1007/s00245-014-9278-9

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper deals with a mean-variance problem for finite horizon semi-Markov decision processes. The state and action spaces are Borel spaces, while the reward function may be unbounded. The goal is to seek an optimal policy with minimal finite horizon reward variance over the set of policies with a given mean. Using the theory of -step contraction, we give a characterization of policies with a given mean and convert the second order moment of the finite horizon reward to a mean of an infinite horizon reward/cost generated by a discrete-time Markov decision processes (MDP) with a two dimension state space and a new one-step reward/cost under suitable conditions. We then establish the optimality equation and the existence of mean-variance optimal policies by employing the existing results of discrete-time MDPs. We also provide a value iteration and a policy improvement algorithms for computing the value function and mean-variance optimal policies, respectively. In addition, a linear program and the dual program are developed for solving the mean-variance problem.

引用

页码：233 / 259

页数：27

共 50 条

[1] Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes
Yonghui Huang
Xianping Guo
Applied Mathematics & Optimization, 2015, 72 : 233 - 259
[2] MEAN-VARIANCE OPTIMALITY FOR SEMI-MARKOV DECISION PROCESSES UNDER FIRST PASSAGE CRITERIA
Huang, Xiangxiang
Huang, Yonghui
KYBERNETIKA, 2017, 53 (01) : 59 - 81
[3] Finite horizon semi-Markov decision processes with multiple constraints
Huang, Yonghui
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1761 - 1768
[4] On mean reward variance in semi-Markov processes
Karel Sladký
Mathematical Methods of Operations Research, 2005, 62 : 387 - 397
[5] On mean reward variance in semi-Markov processes
Sladky, K
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 62 (03) : 387 - 397
[6] THE EXPONENTIAL COST OPTIMALITY FOR FINITE HORIZON SEMI-MARKOV DECISION PROCESSES
Huo, Haifeng
Wen, Xian
KYBERNETIKA, 2022, 58 (03) : 301 - 319
[7] Non-Stationary Semi-Markov Decision Processes on a Finite Horizon
Ghosh, Mrinal K.
Saha, Subhamay
STOCHASTIC ANALYSIS AND APPLICATIONS, 2013, 31 (01) : 183 - 190
[8] Minimum risk probability for finite horizon semi-Markov decision processes
Huang, Yonghui
Guo, Xianping
Li, Zhongfei
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 402 (01) : 378 - 391
[9] Finite horizon semi-Markov decision processes with application to maintenance systems
Huang, Yonghui
Guo, Xianping
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 212 (01) : 131 - 140
[10] Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes
Guo, Xianping
Song, Xinyuan
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (09) : 2151 - 2157

← 1 2 3 4 5 →