Optimal policies for constrained average-cost Markov decision processes

被引:0
|
作者
Juan González-Hernández
César E. Villarreal
机构
[1] Universidad Nacional Autónoma de México,Departamento de Probabilidad y Estadística, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas
[2] Universidad Autónoma de Nuevo León,Posgrado en Ingeniería de Sistemas, Facultad de Ingeniería Mecánica y Eléctrica
[3] Ciudad Universitaria,undefined
来源
TOP | 2011年 / 19卷
关键词
Markov decision processes; Constraints; Stable measures; 90C40;
D O I
暂无
中图分类号
学科分类号
摘要
We give mild conditions for the existence of optimal solutions for a Markov decision problem with average cost, under m constraints of the same kind, in Borel actions and states spaces. Moreover, there is an optimal policy that is a convex combination of at most m+1 deterministic policies.
引用
收藏
页码:107 / 120
页数:13
相关论文
共 50 条
  • [21] AVERAGE OPTIMAL POLICIES IN MARKOV DECISION DRIFT PROCESSES WITH APPLICATIONS TO A QUEUING AND A REPLACEMENT MODEL
    HORDIJK, A
    SCHOUTEN, FAV
    ADVANCES IN APPLIED PROBABILITY, 1983, 15 (02) : 274 - 303
  • [22] A note on the existence of optimal stationary policies for average Markov decision processes with countable states
    Xia, Li
    Guo, Xianping
    Cao, Xi-Ren
    AUTOMATICA, 2023, 151
  • [23] Optimal Policies for Quantum Markov Decision Processes
    Ying, Ming-Sheng
    Feng, Yuan
    Ying, Sheng-Gang
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (03) : 410 - 421
  • [24] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
    Sladky, Karel
    KYBERNETIKA, 2010, 46 (03) : 558 - 570
  • [25] Optimal adaptive policies for Markov decision processes
    Burnetas, AN
    Katehakis, MN
    MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (01) : 222 - 255
  • [26] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    Machine Intelligence Research, 2021, 18 (03) : 410 - 421
  • [27] Optimal Policies for Quantum Markov Decision Processes
    Ming-Sheng Ying
    Yuan Feng
    Sheng-Gang Ying
    International Journal of Automation and Computing, 2021, 18 : 410 - 421
  • [28] Learning algorithms or Markov decision processes with average cost
    Abounadi, J
    Bertsekas, D
    Borkar, VS
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2001, 40 (03) : 681 - 698
  • [29] AVERAGE COST SEMI-MARKOV DECISION PROCESSES
    ROSS, SM
    JOURNAL OF APPLIED PROBABILITY, 1970, 7 (03) : 649 - &
  • [30] Optimal control of average reward constrained continuous-time finite Markov Decision Processes
    Feinberg, EA
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3805 - 3810