Equivalence classes for optimizing risk models in Markov decision processes

被引：0

作者：

Yoshio Ohtsubo

Kenji Toyonaga

机构：

[1] Kochi University,Department of Mathematics

[2] Kyushu University,Graduate School of Mathematics

来源：

Mathematical Methods of Operations Research | 2004年 / 60卷

关键词：

Markov decision process; Threshold probability; Equivalence relation; Existence of optimal policy;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We consider eight problems in which we maximize or minimize threshold probabilities in discounted Markov decision processes with bounded reward set. We show that such problems are classified to two equivalence classes and give a relationship between optimal values and optimal policies of problems in each equivalence class. Literatures relative to such problems deal with only first equivalence class (cf. White(1993), Wu and Lin(1999) and Ohtsubo and Toyonaga(2002)). We consider a problem of the second equivalence class in the same situation as Ohtsubo and Toyonaga and characterize optimal values in finite and infinite horizon cases, by using an argument of a dual problem. We also give two sufficient conditions for the existence of an optimal policy. Finally we give a relationship of optimal values between first and second equivalence classes.

引用

页码：239 / 250

页数：11

共 50 条

[1] Equivalence classes for optimizing risk models in Markov decision processes
Ohtsubo, Y
Toyonaga, K
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2004, 60 (02) : 239 - 250
[2] Dynamical equivalence classes for Markov jump processes
Verley, Gatien
[J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (02):
[3] Approximate equivalence of Markov decision processes
Even-Dar, E
Mansour, Y
[J]. LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 581 - 594
[4] Counting Markov equivalence classes for DAG models on trees
Radhakrishnan, Adityanarayanan
Solus, Liam
Uhler, Caroline
[J]. DISCRETE APPLIED MATHEMATICS, 2018, 244 : 170 - 185
[5] Optimal policy for minimizing risk models in Markov decision processes
Ohtsubo, Y
Toyonaga, K
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2002, 271 (01) : 66 - 81
[6] Ordinal Decision Models for Markov Decision Processes
Weng, Paul
[J]. 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
[7] Equivalence notions and model minimization in Markov decision processes
Givan, R
Dean, T
Greig, M
[J]. ARTIFICIAL INTELLIGENCE, 2003, 147 (1-2) : 163 - 223
[8] Characterizing Markov equivalence classes for AMP chain graph models
Andersson, Steen A.
Perlman, Michael D.
[J]. ANNALS OF STATISTICS, 2006, 34 (02): : 939 - 972
[9] The size distribution for Markov equivalence classes of acyclic digraph models
Gillispie, SB
Perlman, MD
[J]. ARTIFICIAL INTELLIGENCE, 2002, 141 (1-2) : 137 - 155
[10] Size of Interventional Markov Equivalence Classes in Random DAG Models
Katz, Dmitriy
Shanmugam, Karthikeyan
Squires, Chandler
Uhler, Caroline
[J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89

← 1 2 3 4 5 →