Robustness of policies in constrained Markov decision processes

被引:15
|
作者
Zadorojniy, A [1 ]
Shwartz, A
机构
[1] Intel Israel Ltd, MATAM, IL-31015 Haifa, Israel
[2] Technion Israel Inst Technol, Fac Elect Engn, IL-32000 Haifa, Israel
关键词
constrained Markov decision process (MDP); discounted cost; Markov decision processes; robustness; sensitivity;
D O I
10.1109/TAC.2006.872754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the optimization of finite-state, finite-action Markov decision processes (MDPs), under constraints. Cost and constraints are discounted. We introduce a new method for investigating the continuity, and a certain type of robustness, of the optimal cost and the optimal policy under changes in the constraints. This method is also applicable for other cost criteria such as finite horizon and infinite horizon average cost.
引用
收藏
页码:635 / 638
页数:4
相关论文
共 50 条
  • [1] Non-randomized policies for constrained Markov decision processes
    Chen, Richard C.
    Feinberg, Eugene A.
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 66 (01) : 165 - 179
  • [2] Non-randomized policies for constrained Markov decision processes
    Richard C. Chen
    Eugene A. Feinberg
    [J]. Mathematical Methods of Operations Research, 2007, 66 : 165 - 179
  • [3] Optimal policies for constrained average-cost Markov decision processes
    Juan González-Hernández
    César E. Villarreal
    [J]. TOP, 2011, 19 : 107 - 120
  • [4] Optimal policies for constrained average-cost Markov decision processes
    Gonzalez-Hernandez, Juan
    Villarreal, Cesar E.
    [J]. TOP, 2011, 19 (01) : 107 - 120
  • [5] Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes
    Brazdil, Tomas
    Chatterjee, Krishnendu
    Novotny, Petr
    Vahala, Jiri
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9794 - 9801
  • [6] On constrained Markov decision processes
    Haviv, M
    [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [7] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
  • [8] On Markov policies for minimax decision processes
    Iwamoto, S
    Tsurusaki, K
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2001, 253 (01) : 58 - 78
  • [9] Optimal Decision Tree Policies for Markov Decision Processes
    Vos, Daniel
    Verwer, Sicco
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5457 - 5465
  • [10] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    [J]. CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660