CONSTRAINED AND UNCONSTRAINED OPTIMAL DISCOUNTED CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

被引:21
|
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
Piunovskiy, A. B. [3 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil
[2] Univ Bordeaux, Inst Polytech Bordeaux, INRIA Bordeaux Sud Ouest, Team CQFD,IMB,Inst Math Bordeaux, Bordeaux, France
[3] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
基金
巴西圣保罗研究基金会; 英国工程与自然科学研究理事会;
关键词
unconstrained/constrained control problem; continuous control; piecewise; deterministic; Markov process; continuous-time Markov decision process; discounted cost; DISCRETE-TIME;
D O I
10.1137/140996380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main goal of this paper is to study the in finite-horizon expected discounted continuous-time optimal control problem of piecewise deterministic Markov processes with the control acting continuously on the jump intensity lambda and on the transition measure Q of the process but not on the deterministic flow phi. The contributions of the paper are for the unconstrained as well as the constrained cases. The set of admissible control strategies is assumed to be formed by policies, possibly randomized and depending on the history of the process, taking values in a set valued action space. For the unconstrained case we provide sufficient conditions based on the three local characteristics of the process phi, lambda, Q and the semicontinuity properties of the set valued action space, to guarantee the existence and uniqueness of the integro-differential optimality equation (the so-called Bellman Hamilton Jacobi equation) as well as the existence of an optimal (and delta-optimal, as well) deterministic stationary control strategy for the problem. For the constrained case we show that the values of the constrained control problem and an associated in finite dimensional linear programming (LP) problem are the same, and moreover we provide sufficient conditions for the solvability of the LP problem as well as for the existence of an optimal feasible randomized stationary control strategy for the constrained problem.
引用
收藏
页码:1444 / 1474
页数:31
相关论文
共 50 条
  • [41] Reachability questions in piecewise deterministic Markov processes
    Bujorianu, ML
    Lygeros, J
    HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2003, 2623 : 126 - 140
  • [42] On time reversal of piecewise deterministic Markov processes
    Loepker, Andreas
    Palmowski, Zbigniew
    ELECTRONIC JOURNAL OF PROBABILITY, 2013, 18 : 1 - 29
  • [43] Piecewise deterministic Markov processes and their invariant measures
    Durmus, Alain
    Guillin, Arnaud
    Monmarche, Pierre
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2021, 57 (03): : 1442 - 1475
  • [44] Densities for piecewise deterministic Markov processes with boundary
    Gwizdz, Piotr
    Tyran-Kaminska, Marta
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 479 (01) : 384 - 425
  • [45] Stability of piecewise-deterministic Markov processes
    Dufour, F
    Costa, OLV
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (05) : 1483 - 1502
  • [46] Stability and ergodicity of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (02) : 1053 - 1077
  • [47] Demographic noise and piecewise deterministic Markov processes
    Realpe-Gomez, John
    Galla, Tobias
    McKane, Alan J.
    PHYSICAL REVIEW E, 2012, 86 (01):
  • [48] Infinite dimensional Piecewise Deterministic Markov Processes
    Dobson, Paul
    Bierkens, Joris
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2023, 165 : 337 - 396
  • [49] OPTIMAL CONTROL OF INFINITE-DIMENSIONAL PIECEWISE DETERMINISTIC MARKOV PROCESSES AND APPLICATION TO THE CONTROL OF NEURONAL DYNAMICS VIA OPTOGENETICS
    Renault, Vincent
    Thieullen, Michele
    Trelat, Emmanuel
    NETWORKS AND HETEROGENEOUS MEDIA, 2017, 12 (03) : 417 - 459
  • [50] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244