Strong average optimality for controlled nonhomogeneous Markov chains

被引:3
|
作者
Guo, XP
Shi, P
Zhu, WP
机构
[1] Zhongshan Univ, Dept Math, Canton 510275, Peoples R China
[2] Def Sci & Technol Org, Land Operat Div, Salisbury, SA 5108, Australia
[3] Univ Queensland, Dept Comp Sci & Elect Engn, St Lucia, Qld 4072, Australia
关键词
controlled nonhomogeneous Markov chains (CNMCs); strong average criterion; optimality equations (OEs); strong average epsilon-optimal policies; algorithm;
D O I
10.1081/SAP-100001186
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, the problem of three kinds of average criteria (totally so-called strong average criterion) for controlled nonhomogeneous Markov chains with possibly unbounded rewards and denumerable state space is studied. We proposed a new set of conditions under which the existence of both a solution to the optimality equations and the strong average epsilon -optimal Markov policies is derived by martingale theory. Furthermore, an algorithm for computing strong average epsilon(>0)-optimal Markov policies is presented.
引用
收藏
页码:115 / 134
页数:20
相关论文
共 50 条
  • [1] The strong law of large numbers for moving average of continuous state nonhomogeneous Markov chains
    Wang, Bei
    Shi, Zhiyan
    Yang, Weiguo
    [J]. STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2020, 92 (05) : 732 - 745
  • [2] Average Optimality in Nonhomogeneous Infinite Horizon Markov Decision Processes
    Wachs, Allise O.
    Schochetman, Irwin E.
    Smith, Robert L.
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2011, 36 (01) : 147 - 164
  • [3] Some strong limit theorems for nonhomogeneous Markov chains indexed by controlled trees
    Weicai Peng
    Jie Liu
    Yongchao Hou
    Peishu Chen
    Jueping Bu
    [J]. Journal of Inequalities and Applications, 2016
  • [4] Some strong limit theorems for nonhomogeneous Markov chains indexed by controlled trees
    Peng, Weicai
    Liu, Jie
    Hou, Yongchao
    Chen, Peishu
    Bu, Jueping
    [J]. JOURNAL OF INEQUALITIES AND APPLICATIONS, 2016, : 1 - 9
  • [5] Optimization of Average Rewards of Time Nonhomogeneous Markov Chains
    Cao, Xi-Ren
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (07) : 1841 - 1856
  • [6] Controlled Markov chains with weak and strong interactions: Asymptotic optimality and applications to manufacturing
    Department of Mathematics, University of Georgia, Athens, GA, United States
    不详
    不详
    [J]. J. Optim. Theory Appl., 1 (169-194):
  • [7] Controlled Markov Chains with Weak and Strong Interactions: Asymptotic Optimality and Applications to Manufacturing
    Q. Zhang
    G. Yin
    E. K. Boukas
    [J]. Journal of Optimization Theory and Applications, 1997, 94 : 169 - 194
  • [8] Controlled Markov chains with weak and strong interactions: Asymptotic optimality and applications to manufacturing
    Zhang, Q
    Yin, G
    Boukas, EK
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1997, 94 (01) : 169 - 194
  • [9] Denumerable controlled markov chains with average reward criterion. Sample path optimality
    Cavazos-Cadena, Rolando
    Fernandez-Gaucherand, Emmanuel
    [J]. ZOR. Zeitschrift Fuer Operations Research, 41 (01):
  • [10] Risk-Sensitive Average Optimality in Markov Decision Chains
    Sladky, Karel
    Montes-de-Oca, Raul
    [J]. OPERATIONS RESEARCH PROCEEDINGS 2007, 2008, : 69 - +