Equivalence classes for optimizing risk models in Markov decision processes

被引:0
|
作者
Yoshio Ohtsubo
Kenji Toyonaga
机构
[1] Kochi University,Department of Mathematics
[2] Kyushu University,Graduate School of Mathematics
关键词
Markov decision process; Threshold probability; Equivalence relation; Existence of optimal policy;
D O I
暂无
中图分类号
学科分类号
摘要
We consider eight problems in which we maximize or minimize threshold probabilities in discounted Markov decision processes with bounded reward set. We show that such problems are classified to two equivalence classes and give a relationship between optimal values and optimal policies of problems in each equivalence class. Literatures relative to such problems deal with only first equivalence class (cf. White(1993), Wu and Lin(1999) and Ohtsubo and Toyonaga(2002)). We consider a problem of the second equivalence class in the same situation as Ohtsubo and Toyonaga and characterize optimal values in finite and infinite horizon cases, by using an argument of a dual problem. We also give two sufficient conditions for the existence of an optimal policy. Finally we give a relationship of optimal values between first and second equivalence classes.
引用
收藏
页码:239 / 250
页数:11
相关论文
共 50 条
  • [1] Equivalence classes for optimizing risk models in Markov decision processes
    Ohtsubo, Y
    Toyonaga, K
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2004, 60 (02) : 239 - 250
  • [2] Dynamical equivalence classes for Markov jump processes
    Verley, Gatien
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (02):
  • [3] Approximate equivalence of Markov decision processes
    Even-Dar, E
    Mansour, Y
    [J]. LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 581 - 594
  • [4] Counting Markov equivalence classes for DAG models on trees
    Radhakrishnan, Adityanarayanan
    Solus, Liam
    Uhler, Caroline
    [J]. DISCRETE APPLIED MATHEMATICS, 2018, 244 : 170 - 185
  • [5] Optimal policy for minimizing risk models in Markov decision processes
    Ohtsubo, Y
    Toyonaga, K
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2002, 271 (01) : 66 - 81
  • [6] Ordinal Decision Models for Markov Decision Processes
    Weng, Paul
    [J]. 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
  • [7] Equivalence notions and model minimization in Markov decision processes
    Givan, R
    Dean, T
    Greig, M
    [J]. ARTIFICIAL INTELLIGENCE, 2003, 147 (1-2) : 163 - 223
  • [8] Characterizing Markov equivalence classes for AMP chain graph models
    Andersson, Steen A.
    Perlman, Michael D.
    [J]. ANNALS OF STATISTICS, 2006, 34 (02): : 939 - 972
  • [9] The size distribution for Markov equivalence classes of acyclic digraph models
    Gillispie, SB
    Perlman, MD
    [J]. ARTIFICIAL INTELLIGENCE, 2002, 141 (1-2) : 137 - 155
  • [10] Size of Interventional Markov Equivalence Classes in Random DAG Models
    Katz, Dmitriy
    Shanmugam, Karthikeyan
    Squires, Chandler
    Uhler, Caroline
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89