Optimal policies for constrained average-cost Markov decision processes

被引：0

作者：

Juan González-Hernández

César E. Villarreal

机构：

[1] Universidad Nacional Autónoma de México,Departamento de Probabilidad y Estadística, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas

[2] Universidad Autónoma de Nuevo León,Posgrado en Ingeniería de Sistemas, Facultad de Ingeniería Mecánica y Eléctrica

[3] Ciudad Universitaria,undefined

来源：

TOP | 2011年 / 19卷

关键词：

Markov decision processes; Constraints; Stable measures; 90C40;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We give mild conditions for the existence of optimal solutions for a Markov decision problem with average cost, under m constraints of the same kind, in Borel actions and states spaces. Moreover, there is an optimal policy that is a convex combination of at most m+1 deterministic policies.

引用

页码：107 / 120

页数：13

共 50 条

[21] AVERAGE OPTIMAL POLICIES IN MARKOV DECISION DRIFT PROCESSES WITH APPLICATIONS TO A QUEUING AND A REPLACEMENT MODEL
HORDIJK, A
SCHOUTEN, FAV
ADVANCES IN APPLIED PROBABILITY, 1983, 15 (02) : 274 - 303
[22] A note on the existence of optimal stationary policies for average Markov decision processes with countable states
Xia, Li
Guo, Xianping
Cao, Xi-Ren
AUTOMATICA, 2023, 151
[23] Optimal Policies for Quantum Markov Decision Processes
Ying, Ming-Sheng
Feng, Yuan
Ying, Sheng-Gang
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (03) : 410 - 421
[24] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
Sladky, Karel
KYBERNETIKA, 2010, 46 (03) : 558 - 570
[25] Optimal adaptive policies for Markov decision processes
Burnetas, AN
Katehakis, MN
MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (01) : 222 - 255
[26] Optimal Policies for Quantum Markov Decision Processes
Ming-Sheng Ying
Yuan Feng
Sheng-Gang Ying
Machine Intelligence Research, 2021, 18 (03) : 410 - 421
[27] Optimal Policies for Quantum Markov Decision Processes
Ming-Sheng Ying
Yuan Feng
Sheng-Gang Ying
International Journal of Automation and Computing, 2021, 18 : 410 - 421
[28] Learning algorithms or Markov decision processes with average cost
Abounadi, J
Bertsekas, D
Borkar, VS
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2001, 40 (03) : 681 - 698
[29] AVERAGE COST SEMI-MARKOV DECISION PROCESSES
ROSS, SM
JOURNAL OF APPLIED PROBABILITY, 1970, 7 (03) : 649 - &
[30] Optimal control of average reward constrained continuous-time finite Markov Decision Processes
Feinberg, EA
PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3805 - 3810

← 1 2 3 4 5 →