On a reward rate estimation for the finite irreducible continuous-time Markov chain

被引:0
|
作者
Andronov A. [1 ]
机构
[1] Mathematical Methods and Modeling, Transport and Telecommunication Institute, Riga
关键词
Markov chain; Methods of point estimation; simulation;
D O I
10.1080/15598608.2017.1282895
中图分类号
学科分类号
摘要
A continuous-time homogeneous irreducible Markov chain {X(t)}, t ϵ[0,∞) taking values on {1,...,k},k <∞ is considered. Matrix (λij) of the intensity of transition λij from state i to state j is known. A unit of the sojourn time in state i gives reward βi so the total reward during time Y(t)=ʃt0βX(s)ds. The reward rates {βi}. are not known and it is necessary to estimate them. For that purpose the following statistical data on r observations are at our disposal: (1) t, observation time; (2) i, initial state X(0); (3) j, final state X(t); and (4) y, acquired reward Y(t). Two methods are used for the estimation: the weighted least-squares method and the saddle-point method for the Laplace transformation of the reward. Simulation study illustrates the suggested approaches. © 2017 Grace Scientific Publishing, LLC.
引用
收藏
页码:407 / 417
页数:10
相关论文
共 50 条
  • [1] Efficient Continuous-Time Markov Chain Estimation
    Hajiaghayi, Monir
    Kirkpatrick, Bonnie
    Wang, Liangliang
    Bouchard-Cote, Alexandre
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [2] SEQUENTIAL ESTIMATION FOR CONTINUOUS-TIME FINITE MARKOV-PROCESSES
    ADKE, SR
    MANJUNATH, SM
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1984, 18 (02) : 227 - 227
  • [3] SEQUENTIAL ESTIMATION FOR CONTINUOUS-TIME FINITE MARKOV-PROCESSES
    ADKE, SR
    MANJUNATH, SM
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1984, 13 (09) : 1089 - 1106
  • [4] On the total reward variance for continuous-time Markov reward chains
    Van Dijk, Nico M.
    Sladky, Karel
    JOURNAL OF APPLIED PROBABILITY, 2006, 43 (04) : 1044 - 1052
  • [5] Nonparametric estimation of reliability and survival function for continuous-time finite Markov
    Sadek, A
    Limnios, N
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 133 (01) : 1 - 21
  • [6] Transient Reward Approximation for Continuous-Time Markov Chains
    Hahn, Ernst Moritz
    Hermanns, Holger
    Wimmer, Ralf
    Becker, Bernd
    IEEE TRANSACTIONS ON RELIABILITY, 2015, 64 (04) : 1254 - 1275
  • [7] Optimal control of average reward constrained continuous-time finite Markov Decision Processes
    Feinberg, EA
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3805 - 3810
  • [8] Distributions of reward functions on continuous-time Markov chains
    Bladt, M
    Meini, B
    Neuts, MF
    Sericola, B
    MATRIX-ANALYTIC METHODS:THEORY AND APPLICATIONS, 2002, : 39 - 62
  • [9] The deviation matrix of a continuous-time Markov chain
    Coolen-Schrijner, P
    van Doorn, EA
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2002, 16 (03) : 351 - 366
  • [10] An Imprecise Probabilistic Estimator for the Transition Rate Matrix of a Continuous-Time Markov Chain
    Krak, Thomas
    Erreygers, Alexander
    De Bock, Jasper
    UNCERTAINTY MODELLING IN DATA SCIENCE, 2019, 832 : 124 - 132