Transient and Average Markov Reward Chains with Applications to Finance

被引:0
|
作者
Sladky, Karel [1 ]
机构
[1] Czech Acad Sci, Inst Informat Theory & Automat, Vodarenskou Vezi 4, Prague 18208 8, Czech Republic
关键词
dynamic programming; transient and average Markov reward chains; reward-variance optimality; optimality in financial models; VARIANCE;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
The article is devoted to Markov reward chains with finite state space. Since the usual optimization criteria examined in the literature on Markov reward chains, such as a total discounted, total reward up to reaching some specific state (called transient models) or mean (average) reward optimality, may be quite insufficient to characterize the problem from the point of a decision maker. It seems that it may be preferable if not necessary to select more sophisticated criteria that also reflect variability-risk features of the problem. Perhaps the best known approaches stem from the classical work of Markowitz on mean variance selection rules, i.e. we optimize the weighted sum of average or total reward and its variance. In the article explicit formulae for calculating the variances for transient and average models are reported along with sketches of algorithmic procedures for finding policies guaranteeing minimal variance in the class of policies with a given transient or average reward. Application of the obtained results to financial models is indicated.
引用
收藏
页码:773 / 778
页数:6
相关论文
共 50 条