Learning and optimal control of imprecise Markov decision processes by dynamic programming using the imprecise Dirichlet model

被引:0
|
作者
Troffaes, MCM [1 ]
机构
[1] State Univ Ghent, SYSTeMS Res Grp, B-9052 Zwijnaarde, Belgium
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the conditions under which dynamic programming yields a solution to simultaneous learning and optimal control of a Markov decision process. First, we introduce a new optimality criterion that allows act-state dependence. This criterion is based on a partial preference ordering induced by an imprecise probability model of the dynamics of the system, updated by observations of the state and control history of the system. Then, we show that dynamic programming yields the set of all optimal solutions if the imprecise probability model satisfies particular properties. When we model learning of the system dynamics by an imprecise Dirichlet model, these properties turn out to be satisfied.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [1] Real-time dynamic programming for Markov decision processes with imprecise probabilities
    Delgado, Karina V.
    de Barros, Leliane N.
    Dias, Daniel B.
    Sanner, Scott
    [J]. ARTIFICIAL INTELLIGENCE, 2016, 230 : 192 - 223
  • [2] Multilinear and Integer Programming for Markov Decision Processes with Imprecise Probabilities
    Shirota Filho, Ricardo
    Cozman, Fabio Gagliardi
    Trevizan, Felipe Werndl
    de Campos, Cassio Polpo
    de Barros, Leliane Nunes
    [J]. ISIPTA 07-PROCEEDINGS OF THE FIFTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY:THEORIES AND APPLICATIONS, 2007, : 395 - +
  • [3] Using mathematical programming to solve Factored Markov Decision Processes with Imprecise Probabilities
    Delgado, Karina Valdivia
    de Barros, Leliane Nunes
    Cozman, Fabio Gagliardi
    Sanner, Scott
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (07) : 1000 - 1017
  • [4] Learning in games using the imprecise Dirichlet model
    Quaeghebeur, Erik
    de Cooman, Gert
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (02) : 243 - 256
  • [5] Generalizing Markov decision processes to imprecise probabilities
    Harmanec, D
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2002, 105 (01) : 199 - 213
  • [6] Partially observable Markov decision processes with imprecise parameters
    Itoh, Hideaki
    Nakamura, Kiyohiko
    [J]. ARTIFICIAL INTELLIGENCE, 2007, 171 (8-9) : 453 - 490
  • [7] Decision making under incomplete data using the imprecise Dirichlet model
    Utkin, L. V.
    Augustin, Th.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2007, 44 (03) : 322 - 338
  • [8] Decision making under incomplete data using the imprecise Dirichlet model
    Utkin, L. V.
    Augustin, Th.
    [J]. ISIPTA 05-PROCEEDINGS OF THE FOURTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES AND THEIR APPLICATIONS, 2005, : 359 - 368
  • [9] MARKOV DECISION-PROCESSES WITH IMPRECISE TRANSITION-PROBABILITIES
    WHITE, CC
    ELDEIB, HK
    [J]. OPERATIONS RESEARCH, 1994, 42 (04) : 739 - 749
  • [10] Representing and Solving Factored Markov Decision Processes with Imprecise Probabilities
    Delgado, Karina Valdivia
    de Barros, Leliane Nunes
    Cozman, Fabio Gagliardi
    Shirota, Ricardo
    [J]. ISIPTA '09: PROCEEDINGS OF THE SIXTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY: THEORIES AND APPLICATIONS, 2009, : 169 - +