Equivalence classes for optimizing risk models in Markov decision processes

被引:0
|
作者
Yoshio Ohtsubo
Kenji Toyonaga
机构
[1] Kochi University,Department of Mathematics
[2] Kyushu University,Graduate School of Mathematics
关键词
Markov decision process; Threshold probability; Equivalence relation; Existence of optimal policy;
D O I
暂无
中图分类号
学科分类号
摘要
We consider eight problems in which we maximize or minimize threshold probabilities in discounted Markov decision processes with bounded reward set. We show that such problems are classified to two equivalence classes and give a relationship between optimal values and optimal policies of problems in each equivalence class. Literatures relative to such problems deal with only first equivalence class (cf. White(1993), Wu and Lin(1999) and Ohtsubo and Toyonaga(2002)). We consider a problem of the second equivalence class in the same situation as Ohtsubo and Toyonaga and characterize optimal values in finite and infinite horizon cases, by using an argument of a dual problem. We also give two sufficient conditions for the existence of an optimal policy. Finally we give a relationship of optimal values between first and second equivalence classes.
引用
收藏
页码:239 / 250
页数:11
相关论文
共 50 条
  • [31] EQUIVALENCE BETWEEN CONTINUOUS AND DISCRETE-TIME MARKOV DECISION-PROCESSES
    SERFOZO, RF
    [J]. OPERATIONS RESEARCH, 1979, 27 (03) : 616 - 620
  • [32] EQUIVALENCE OF LYAPUNOV STABILITY-CRITERIA IN A CLASS OF MARKOV DECISION-PROCESSES
    CAVAZOSCADENA, R
    HERNANDEZLERMA, O
    [J]. APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02): : 113 - 137
  • [33] A characterization of Markov equivalence classes for acyclic digraphs
    Andersson, SA
    Madigan, D
    Perlman, MD
    [J]. ANNALS OF STATISTICS, 1997, 25 (02): : 505 - 541
  • [34] EQUIVALENCE CLASSES OF FUNCTIONS OF FINITE MARKOV CHAINS
    WANDELL, BA
    GREENO, JG
    EGAN, DE
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1974, 11 (04) : 391 - 403
  • [35] Counting Markov Equivalence Classes by Number of Immoralities
    Radhakrishnan, Adityanarayanan
    Solus, Liam
    Uhler, Caroline
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [36] Knowledge Transfer for Learning Markov Equivalence Classes
    Rodriguez-Lopez, Veronica
    Enrique Sucar, Luis
    [J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 377 - 388
  • [37] Markov decision processes with iterated coherent risk measures
    Chu, Shanyun
    Zhang, Yi
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (11) : 2286 - 2293
  • [38] More Risk-Sensitive Markov Decision Processes
    Baeuerle, Nicole
    Rieder, Ulrich
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2014, 39 (01) : 105 - 120
  • [39] Constrained Risk-Averse Markov Decision Processes
    Ahmadi, Mohamadreza
    Rosolia, Ugo
    Ingham, Michel D.
    Murray, Richard M.
    Ames, Aaron D.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11718 - 11725
  • [40] Solving Markov Decision Processes with Downside Risk Adjustment
    Abhijit Gosavi
    Anish Parulekar
    [J]. Machine Intelligence Research, 2016, 13 (03) : 235 - 245