Strong average optimality for controlled nonhomogeneous Markov chains

被引：3

作者：

Guo, XP

Shi, P

Zhu, WP

机构：

[1] Zhongshan Univ, Dept Math, Canton 510275, Peoples R China

[2] Def Sci & Technol Org, Land Operat Div, Salisbury, SA 5108, Australia

[3] Univ Queensland, Dept Comp Sci & Elect Engn, St Lucia, Qld 4072, Australia

来源：

STOCHASTIC ANALYSIS AND APPLICATIONS | 2001年 / 19卷 / 01期

关键词：

controlled nonhomogeneous Markov chains (CNMCs); strong average criterion; optimality equations (OEs); strong average epsilon-optimal policies; algorithm;

D O I：

10.1081/SAP-100001186

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

In this paper, the problem of three kinds of average criteria (totally so-called strong average criterion) for controlled nonhomogeneous Markov chains with possibly unbounded rewards and denumerable state space is studied. We proposed a new set of conditions under which the existence of both a solution to the optimality equations and the strong average epsilon -optimal Markov policies is derived by martingale theory. Furthermore, an algorithm for computing strong average epsilon(>0)-optimal Markov policies is presented.

引用

页码：115 / 134

页数：20

共 50 条

[1] The strong law of large numbers for moving average of continuous state nonhomogeneous Markov chains
Wang, Bei
Shi, Zhiyan
Yang, Weiguo
[J]. STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2020, 92 (05) : 732 - 745
[2] Average Optimality in Nonhomogeneous Infinite Horizon Markov Decision Processes
Wachs, Allise O.
Schochetman, Irwin E.
Smith, Robert L.
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2011, 36 (01) : 147 - 164
[3] Some strong limit theorems for nonhomogeneous Markov chains indexed by controlled trees
Weicai Peng
Jie Liu
Yongchao Hou
Peishu Chen
Jueping Bu
[J]. Journal of Inequalities and Applications, 2016
[4] Some strong limit theorems for nonhomogeneous Markov chains indexed by controlled trees
Peng, Weicai
Liu, Jie
Hou, Yongchao
Chen, Peishu
Bu, Jueping
[J]. JOURNAL OF INEQUALITIES AND APPLICATIONS, 2016, : 1 - 9
[5] Optimization of Average Rewards of Time Nonhomogeneous Markov Chains
Cao, Xi-Ren
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (07) : 1841 - 1856
[6] Controlled Markov chains with weak and strong interactions: Asymptotic optimality and applications to manufacturing
Department of Mathematics, University of Georgia, Athens, GA, United States
不详
不详
[J]. J. Optim. Theory Appl., 1 (169-194):
[7] Controlled Markov Chains with Weak and Strong Interactions: Asymptotic Optimality and Applications to Manufacturing
Q. Zhang
G. Yin
E. K. Boukas
[J]. Journal of Optimization Theory and Applications, 1997, 94 : 169 - 194
[8] Controlled Markov chains with weak and strong interactions: Asymptotic optimality and applications to manufacturing
Zhang, Q
Yin, G
Boukas, EK
[J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1997, 94 (01) : 169 - 194
[9] Denumerable controlled markov chains with average reward criterion. Sample path optimality
Cavazos-Cadena, Rolando
Fernandez-Gaucherand, Emmanuel
[J]. ZOR. Zeitschrift Fuer Operations Research, 41 (01):
[10] Risk-Sensitive Average Optimality in Markov Decision Chains
Sladky, Karel
Montes-de-Oca, Raul
[J]. OPERATIONS RESEARCH PROCEEDINGS 2007, 2008, : 69 - +

← 1 2 3 4 5 →