The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

被引:3
|
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil
[2] Univ Bordeaux 1, IMB, Team CQFD, INRIA Bordeaux Sud Ouest, F-33405 Talence, France
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2010年 / 62卷 / 02期
关键词
Piecewise-deterministic Markov Processes; Continuous-time; Long-run average cost; Optimal control; Integro-differential optimality inequation; Policy iteration algorithm; DECISION-PROCESSES; OPTIMALITY;
D O I
10.1007/s00245-010-9099-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.
引用
收藏
页码:185 / 204
页数:20
相关论文
共 50 条
  • [41] On time reversal of piecewise deterministic Markov processes
    Loepker, Andreas
    Palmowski, Zbigniew
    ELECTRONIC JOURNAL OF PROBABILITY, 2013, 18 : 1 - 29
  • [42] Densities for piecewise deterministic Markov processes with boundary
    Gwizdz, Piotr
    Tyran-Kaminska, Marta
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 479 (01) : 384 - 425
  • [43] Reachability questions in piecewise deterministic Markov processes
    Bujorianu, ML
    Lygeros, J
    HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2003, 2623 : 126 - 140
  • [44] Stability and Ergodicity of Piecewise Deterministic Markov Processes
    Costa, O. L. V.
    Dufour, F.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 1525 - 1530
  • [45] Piecewise Deterministic Markov Processes in Biological Models
    Rudnicki, Ryszard
    Tyran-Kaminska, Marta
    SEMIGROUPS OF OPERATORS - THEORY AND APPLICATIONS, 2015, 113 : 235 - 255
  • [46] Stability of piecewise-deterministic Markov processes
    Dufour, François
    Costa, Oswaldo L. V.
    SIAM Journal on Control and Optimization, 37 (05): : 1483 - 1502
  • [47] Stability of piecewise-deterministic Markov processes
    Dufour, F
    Costa, OLV
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (05) : 1483 - 1502
  • [48] Demographic noise and piecewise deterministic Markov processes
    Realpe-Gomez, John
    Galla, Tobias
    McKane, Alan J.
    PHYSICAL REVIEW E, 2012, 86 (01):
  • [49] Stability and ergodicity of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (02) : 1053 - 1077
  • [50] Infinite dimensional Piecewise Deterministic Markov Processes
    Dobson, Paul
    Bierkens, Joris
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2023, 165 : 337 - 396