Robustness of policies in constrained Markov decision processes

被引：15

作者：

Zadorojniy, A ^{[1
]}

Shwartz, A

机构：

[1] Intel Israel Ltd, MATAM, IL-31015 Haifa, Israel

[2] Technion Israel Inst Technol, Fac Elect Engn, IL-32000 Haifa, Israel

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2006年 / 51卷 / 04期

关键词：

constrained Markov decision process (MDP); discounted cost; Markov decision processes; robustness; sensitivity;

D O I：

10.1109/TAC.2006.872754

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider the optimization of finite-state, finite-action Markov decision processes (MDPs), under constraints. Cost and constraints are discounted. We introduce a new method for investigating the continuity, and a certain type of robustness, of the optimal cost and the optimal policy under changes in the constraints. This method is also applicable for other cost criteria such as finite horizon and infinite horizon average cost.

引用

页码：635 / 638

页数：4

共 50 条

[1] Non-randomized policies for constrained Markov decision processes
Chen, Richard C.
Feinberg, Eugene A.
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 66 (01) : 165 - 179
[2] Non-randomized policies for constrained Markov decision processes
Richard C. Chen
Eugene A. Feinberg
[J]. Mathematical Methods of Operations Research, 2007, 66 : 165 - 179
[3] Optimal policies for constrained average-cost Markov decision processes
Juan González-Hernández
César E. Villarreal
[J]. TOP, 2011, 19 : 107 - 120
[4] Optimal policies for constrained average-cost Markov decision processes
Gonzalez-Hernandez, Juan
Villarreal, Cesar E.
[J]. TOP, 2011, 19 (01) : 107 - 120
[5] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes
Brazdil, Tomas
Chatterjee, Krishnendu
Novotny, Petr
Vahala, Jiri
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801
[6] On constrained Markov decision processes
Haviv, M
[J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
[7] Learning in Constrained Markov Decision Processes
Singh, Rahul
Gupta, Abhishek
Shroff, Ness B.
[J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
[8] On Markov policies for minimax decision processes
Iwamoto, S
Tsurusaki, K
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2001, 253 (01) : 58 - 78
[9] Optimal Decision Tree Policies for Markov Decision Processes
Vos, Daniel
Verwer, Sicco
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5457 - 5465
[10] Dynamic programming in constrained Markov decision processes
Piunovskiy, A. B.
[J]. CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660

← 1 2 3 4 5 →