Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

被引：4

作者：

Huang, Yonghui ^{[1
]}

Guo, Xianping ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2015年 / 72卷 / 02期

关键词：

Finite horizon semi-Markov decision processes; Mean-variance optimal policy; Dynamic programming; Value iteration; Policy improvement; Linear programming; PORTFOLIO SELECTION; RISK PROBABILITY; REWARD VARIANCE; MINIMIZATION;

D O I：

10.1007/s00245-014-9278-9

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper deals with a mean-variance problem for finite horizon semi-Markov decision processes. The state and action spaces are Borel spaces, while the reward function may be unbounded. The goal is to seek an optimal policy with minimal finite horizon reward variance over the set of policies with a given mean. Using the theory of -step contraction, we give a characterization of policies with a given mean and convert the second order moment of the finite horizon reward to a mean of an infinite horizon reward/cost generated by a discrete-time Markov decision processes (MDP) with a two dimension state space and a new one-step reward/cost under suitable conditions. We then establish the optimality equation and the existence of mean-variance optimal policies by employing the existing results of discrete-time MDPs. We also provide a value iteration and a policy improvement algorithms for computing the value function and mean-variance optimal policies, respectively. In addition, a linear program and the dual program are developed for solving the mean-variance problem.

引用

页码：233 / 259

页数：27

共 50 条

[11] Constrained optimality for finite horizon semi-Markov decision processes in Polish spaces
Huang, Yonghui
Li, Zhongfei
Guo, Xianping
OPERATIONS RESEARCH LETTERS, 2014, 42 (02) : 123 - 129
[12] Semi-Markov decision processes with variance minimization criterion
Wei, Qingda
Guo, Xianping
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2015, 13 (01): : 59 - 79
[13] Semi-Markov decision processes with variance minimization criterion
Qingda Wei
Xianping Guo
4OR, 2015, 13 : 59 - 79
[14] Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Xia, Li
Ma, Shuai
MATHEMATICS OF OPERATIONS RESEARCH, 2025,
[15] Algorithmic aspects of mean-variance optimization in Markov decision processes
Mannor, Shie
Tsitsiklis, John N.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2013, 231 (03) : 645 - 653
[16] A RISK MINIMIZATION PROBLEM FOR FINITE HORIZON SEMI-MARKOV DECISION PROCESSES WITH LOSS RATES
Liu, Qiuli
Zou, Xiaolong
JOURNAL OF DYNAMICS AND GAMES, 2018, 5 (02): : 143 - 163
[17] A mean-variance optimization problem for discounted Markov decision processes
Guo, Xianping
Ye, Liuer
Yin, George
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 220 (02) : 423 - 429
[18] Finite horizon continuous-time Markov decision processes with mean and variance criteria
Huang, Yonghui
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (04): : 539 - 564
[19] Finite horizon continuous-time Markov decision processes with mean and variance criteria
Yonghui Huang
Discrete Event Dynamic Systems, 2018, 28 : 539 - 564
[20] Optimal Stopping Time on Semi-Markov Processes with Finite Horizon
Chen, Fang
Guo, Xianping
Liao, Zhong-Wei
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2022, 194 (02) : 408 - 439

← 1 2 3 4 5 →