The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

被引：3

作者：

Costa, O. L. V. ^{[1
]}

Dufour, F. ^{[2
]}

机构：

[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil

[2] Univ Bordeaux 1, IMB, Team CQFD, INRIA Bordeaux Sud Ouest, F-33405 Talence, France

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2010年 / 62卷 / 02期

关键词：

Piecewise-deterministic Markov Processes; Continuous-time; Long-run average cost; Optimal control; Integro-differential optimality inequation; Policy iteration algorithm; DECISION-PROCESSES; OPTIMALITY;

D O I：

10.1007/s00245-010-9099-4

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.

引用

页码：185 / 204

页数：20

共 50 条

[41] On time reversal of piecewise deterministic Markov processes
Loepker, Andreas
Palmowski, Zbigniew
ELECTRONIC JOURNAL OF PROBABILITY, 2013, 18 : 1 - 29
[42] Densities for piecewise deterministic Markov processes with boundary
Gwizdz, Piotr
Tyran-Kaminska, Marta
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 479 (01) : 384 - 425
[43] Reachability questions in piecewise deterministic Markov processes
Bujorianu, ML
Lygeros, J
HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2003, 2623 : 126 - 140
[44] Stability and Ergodicity of Piecewise Deterministic Markov Processes
Costa, O. L. V.
Dufour, F.
47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 1525 - 1530
[45] Piecewise Deterministic Markov Processes in Biological Models
Rudnicki, Ryszard
Tyran-Kaminska, Marta
SEMIGROUPS OF OPERATORS - THEORY AND APPLICATIONS, 2015, 113 : 235 - 255
[46] Stability of piecewise-deterministic Markov processes
Dufour, François
Costa, Oswaldo L. V.
SIAM Journal on Control and Optimization, 37 (05): : 1483 - 1502
[47] Stability of piecewise-deterministic Markov processes
Dufour, F
Costa, OLV
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (05) : 1483 - 1502
[48] Demographic noise and piecewise deterministic Markov processes
Realpe-Gomez, John
Galla, Tobias
McKane, Alan J.
PHYSICAL REVIEW E, 2012, 86 (01):
[49] Stability and ergodicity of piecewise deterministic Markov processes
Costa, O. L. V.
Dufour, F.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (02) : 1053 - 1077
[50] Infinite dimensional Piecewise Deterministic Markov Processes
Dobson, Paul
Bierkens, Joris
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2023, 165 : 337 - 396

← 1 2 3 4 5 →