Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

被引:4
|
作者
Huang, Yonghui [1 ]
Guo, Xianping [1 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2015年 / 72卷 / 02期
关键词
Finite horizon semi-Markov decision processes; Mean-variance optimal policy; Dynamic programming; Value iteration; Policy improvement; Linear programming; PORTFOLIO SELECTION; RISK PROBABILITY; REWARD VARIANCE; MINIMIZATION;
D O I
10.1007/s00245-014-9278-9
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper deals with a mean-variance problem for finite horizon semi-Markov decision processes. The state and action spaces are Borel spaces, while the reward function may be unbounded. The goal is to seek an optimal policy with minimal finite horizon reward variance over the set of policies with a given mean. Using the theory of -step contraction, we give a characterization of policies with a given mean and convert the second order moment of the finite horizon reward to a mean of an infinite horizon reward/cost generated by a discrete-time Markov decision processes (MDP) with a two dimension state space and a new one-step reward/cost under suitable conditions. We then establish the optimality equation and the existence of mean-variance optimal policies by employing the existing results of discrete-time MDPs. We also provide a value iteration and a policy improvement algorithms for computing the value function and mean-variance optimal policies, respectively. In addition, a linear program and the dual program are developed for solving the mean-variance problem.
引用
收藏
页码:233 / 259
页数:27
相关论文
共 50 条
  • [11] Constrained optimality for finite horizon semi-Markov decision processes in Polish spaces
    Huang, Yonghui
    Li, Zhongfei
    Guo, Xianping
    OPERATIONS RESEARCH LETTERS, 2014, 42 (02) : 123 - 129
  • [12] Semi-Markov decision processes with variance minimization criterion
    Wei, Qingda
    Guo, Xianping
    4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2015, 13 (01): : 59 - 79
  • [13] Semi-Markov decision processes with variance minimization criterion
    Qingda Wei
    Xianping Guo
    4OR, 2015, 13 : 59 - 79
  • [14] Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
    Xia, Li
    Ma, Shuai
    MATHEMATICS OF OPERATIONS RESEARCH, 2025,
  • [15] Algorithmic aspects of mean-variance optimization in Markov decision processes
    Mannor, Shie
    Tsitsiklis, John N.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2013, 231 (03) : 645 - 653
  • [16] A RISK MINIMIZATION PROBLEM FOR FINITE HORIZON SEMI-MARKOV DECISION PROCESSES WITH LOSS RATES
    Liu, Qiuli
    Zou, Xiaolong
    JOURNAL OF DYNAMICS AND GAMES, 2018, 5 (02): : 143 - 163
  • [17] A mean-variance optimization problem for discounted Markov decision processes
    Guo, Xianping
    Ye, Liuer
    Yin, George
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 220 (02) : 423 - 429
  • [18] Finite horizon continuous-time Markov decision processes with mean and variance criteria
    Huang, Yonghui
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (04): : 539 - 564
  • [19] Finite horizon continuous-time Markov decision processes with mean and variance criteria
    Yonghui Huang
    Discrete Event Dynamic Systems, 2018, 28 : 539 - 564
  • [20] Optimal Stopping Time on Semi-Markov Processes with Finite Horizon
    Chen, Fang
    Guo, Xianping
    Liao, Zhong-Wei
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2022, 194 (02) : 408 - 439