Absorbing Markov decision processes

被引:2
|
作者
Dufour, Francois [1 ,2 ,3 ]
Prieto-Rumeau, Tomas [4 ]
机构
[1] Univ Bordeaux, Inst Polytech Bordeaux, Bordeaux, France
[2] Univ Bordeaux, Team ASTRAL, INRIA Bordeaux Sud Ouest, Bordeaux, France
[3] Univ Bordeaux, Inst Math Bordeaux, Bordeaux, France
[4] UNED, Stat Dept, Madrid, Spain
关键词
Markov decision processes; absorbing model; occupation measures; characteristic equation; phantom measures; compactness of the set of occupation measures; EQUILIBRIA; POLICIES;
D O I
10.1051/cocv/2024002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study discrete-time absorbing Markov Decision Processes (MDP) with measurable state space and Borel action space with a given initial distribution. For such models, solutions to the characteristic equation that are not occupation measures may exist. Several necessary and sufficient conditions are provided to guarantee that any solution to the characteristic equation is an occupation measure. Under the so-called continuity-compactness conditions, we first show that a measure is precisely an occupation measure if and only if it satisfies the characteristic equation and an additional absolute continuity condition. Secondly, it is shown that the set of occupation measures is compact in the weak-strong topology if and only if the model is uniformly absorbing. Several examples are provided to illustrate our results.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Configurable Markov Decision Processes
    Metelli, Alberto Maria
    Mutti, Mirco
    Restelli, Marcello
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [22] Quantile Markov Decision Processes
    Li, Xiaocheng
    Zhong, Huaiyang
    Brandeau, Margaret L.
    [J]. OPERATIONS RESEARCH, 2021, 70 (03) : 1428 - 1447
  • [23] Robust Markov Decision Processes
    Wiesemann, Wolfram
    Kuhn, Daniel
    Rustem, Berc
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2013, 38 (01) : 153 - 183
  • [24] Possibilistic Markov decision processes
    Sabbadin, R
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2001, 14 (03) : 287 - 300
  • [25] Ordinal Decision Models for Markov Decision Processes
    Weng, Paul
    [J]. 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
  • [26] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 737 - 757
  • [27] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    [J]. RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
  • [28] On the quasi-ergodic distribution of absorbing Markov processes
    He, Guoman
    Zhang, Hanjun
    Zhu, Yixia
    [J]. STATISTICS & PROBABILITY LETTERS, 2019, 149 : 116 - 123
  • [29] ABSORBING MARKOV AND BRANCHING-PROCESSES WITH INSTANTANEOUS RESURRECTION
    PAKES, AG
    [J]. STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1993, 48 (01) : 85 - 106
  • [30] NONPARAMETRIC INFERENCE FOR MARKOV PROCESSES WITH MISSING ABSORBING STATE
    Bakoyannis, Giorgos
    Zhang, Ying
    Yiannoutsos, Constantin T.
    [J]. STATISTICA SINICA, 2019, 29 (04) : 2083 - 2104