Characterizing Markov decision processes

被引：0

作者：

Ratitch, B ^{[1
]}

Precup, D ^{[1
]}

机构：

[1] McGill Univ, Montreal, PQ, Canada

来源：

MACHINE LEARNING: ECML 2002 | 2002年 / 2430卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Problem characteristics often have a significant influence on the difficulty of solving optimization problems. In this paper, we propose attributes for characterizing Markov Decision Processes (MDPs), and discuss how they affect the performance of reinforcement learning algorithms that use function approximation. The attributes measure mainly the amount of randomness in the environment. Their values can be calculated from the MDP model or estimated on-line. We show empirically that two of the proposed attributes have a statistically significant effect on the quality of learning. We discuss how measurements of the proposed MDP attributes can be used to facilitate the design of reinforcement learning systems.

引用

页码：391 / 404

页数：14

共 50 条

[1] Markov decision processes
White, D.J.
Journal of the Operational Research Society, 1995, 46 (06):
[2] Markov Decision Processes
Bäuerle N.
Rieder U.
Jahresbericht der Deutschen Mathematiker-Vereinigung, 2010, 112 (4) : 217 - 243
[3] Online Markov Decision Processes
Even-Dar, Eyal
Kakade, Sham M.
Mansour, Yishay
MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 726 - 736
[4] MARKOV DECISION-PROCESSES
SCHAL, M
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1984, 17 (01) : 13 - 13
[5] A review on Markov Decision Processes
J. A. Filar and LIU Ke Centre for Industrial and Applicable Mathematics
Institute of Applied Mathematics
Chinese Science Bulletin, 1999, (07) : 672 - 672
[6] On constrained Markov decision processes
Haviv, M
OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
[7] MARKOV DECISION-PROCESSES
WHITE, CC
WHITE, DJ
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1989, 39 (01) : 1 - 16
[8] Algebraic Markov Decision Processes
Perny, Patrice
Spanjaard, Olivier
Weng, Paul
19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1372 - 1377
[9] Feature Markov Decision Processes
Hutter, Marcus
ARTIFICIAL GENERAL INTELLIGENCE PROCEEDINGS, 2009, 8 : 61 - 66
[10] Absorbing Markov decision processes
Dufour, Francois
Prieto-Rumeau, Tomas
ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2024, 30

← 1 2 3 4 5 →