Conditions for the existence of the forecast horizons in discounted Markov decision processes.

被引：0

作者：

Cruz-Suárez, D ^{[1
]}

机构：

[1] Univ Juarez Autonoma Tabasco, Div Acad Ciencias Basicas, Cunduacan 86690, Tabasco, Mexico

来源：

8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VIII, PROCEEDINGS: CONTROL, COMMUNICATION AND NETWORK SYSTEMS, TECHNOLOGIES AND APPLICATIONS | 2004年

关键词：

Discounted Markov Decision Processes; Forecast Horizon; dynamic programming equation; value iteration;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with stationary Markov Decision Processes (MDPs) with a finite and an infinite horizons and expected total discounted cost (see [1], [6] and [7]). For simplicity, in this work, we consider that both X and A are finite. We denote a Discounted Markov Decision Process (MDP) with the infinite horizon by M-infinity, and Discounted MDP with the finite horizon by M-n, where n is a positive integer. For M-infinity, we present one condition (see [5]), which ensures the existence of a positive integer N* called Forecast Horizon (FH). The knowledge of the existence of N* permits to reduce the problem of the determination of an optimal policy for M-infinity, just to the determination of an optimal policy for M-N*.

引用

页码：207 / 211

页数：5

共 50 条

[1] COMPUTATIONAL COMPARISON OF POLICY ITERATION ALGORITHMS FOR DISCOUNTED MARKOV DECISION PROCESSES.
Hartley, R.
Lavercombe, A.C.
Thomas, L.C.
1600, (13):
[2] CONDITIONS FOR THE EXISTENCE OF DECISION HORIZONS FOR DISCOUNTED PROBLEMS IN A STOCHASTIC ENVIRONMENT - A NOTE
SETHI, S
BHASKARAN, S
OPERATIONS RESEARCH LETTERS, 1985, 4 (02) : 61 - 64
[3] IDENTIFYING FORECAST HORIZONS IN NONHOMOGENEOUS MARKOV DECISION-PROCESSES
HOPP, WJ
OPERATIONS RESEARCH, 1989, 37 (02) : 339 - 343
[4] Conditions for the uniqueness of optimal policies of discounted Markov decision processes
Cruz-Suárez, D
Montes-de-Oca, R
Salem-Silva, F
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2004, 60 (03) : 415 - 436
[5] Conditions for the uniqueness of optimal policies of discounted Markov decision processes
Daniel Cruz-Suárez
Raúl Montes-de-Oca
Francisco Salem-Silva
Mathematical Methods of Operations Research, 2004, 60 : 415 - 436
[6] A STOPPING RULE FOR FORECAST HORIZONS IN NONHOMOGENEOUS MARKOV DECISION-PROCESSES
BEAN, JC
HOPP, WJ
DUENYAS, I
OPERATIONS RESEARCH, 1992, 40 (06) : 1188 - 1199
[7] Solution and forecast horizons for infinite-horizon nonhornogeneous Markov decision processes
Cheevaprawatdomrong, Torpong
Schochetman, Irwin E.
Smith, Robert L.
Garcia, Alfredo
MATHEMATICS OF OPERATIONS RESEARCH, 2007, 32 (01) : 51 - 72
[8] Discounted Markov decision processes with fuzzy costs
Abdellatif Semmouri
Mostafa Jourhmane
Zineb Belhallaj
Annals of Operations Research, 2020, 295 : 769 - 786
[9] DISCOUNTED AND AVERAGE MARKOV DECISION-PROCESSES WITH UNBOUNDED REWARDS - NEW CONDITIONS
QI, YH
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1992, 171 (01) : 111 - 124
[10] Weighted discounted Markov decision processes with perturbation
Liu Ke
Acta Mathematicae Applicatae Sinica, 1999, 15 (2) : 183 - 189

← 1 2 3 4 5 →