The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

被引:3
|
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil
[2] Univ Bordeaux 1, IMB, Team CQFD, INRIA Bordeaux Sud Ouest, F-33405 Talence, France
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2010年 / 62卷 / 02期
关键词
Piecewise-deterministic Markov Processes; Continuous-time; Long-run average cost; Optimal control; Integro-differential optimality inequation; Policy iteration algorithm; DECISION-PROCESSES; OPTIMALITY;
D O I
10.1007/s00245-010-9099-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.
引用
收藏
页码:185 / 204
页数:20
相关论文
共 50 条
  • [1] The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes
    Costa, O. L. V.
    Dufour, F.
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 506 - 511
  • [2] The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes
    O. L. V. Costa
    F. Dufour
    Applied Mathematics & Optimization, 2010, 62 : 185 - 204
  • [3] AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Costa, O. L. V.
    Dufour, F.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2010, 48 (07) : 4262 - 4291
  • [4] Average continuous control of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1712 - 1717
  • [5] The Vanishing Approach for the Average Continuous Control of Piecewise Deterministic Markov Processes
    Costa, O. L. V.
    Dufour, F.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 3817 - 3822
  • [6] THE VANISHING DISCOUNT APPROACH FOR THE AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Costa, O. L. V.
    Dufour, F.
    JOURNAL OF APPLIED PROBABILITY, 2009, 46 (04) : 1157 - 1183
  • [7] Adaptive average control for piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    Genadot, A.
    SYSTEMS & CONTROL LETTERS, 2024, 192
  • [8] Singular Perturbation for the Discounted Continuous Control of Piecewise Deterministic Markov Processes
    Costa, O. L. V.
    Dufour, F.
    APPLIED MATHEMATICS AND OPTIMIZATION, 2011, 63 (03): : 357 - 384
  • [9] Singular Perturbation for the Discounted Continuous Control of Piecewise Deterministic Markov Processes
    Costa, O. L. V.
    Dufour, F.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 1436 - 1441
  • [10] Singular Perturbation for the Discounted Continuous Control of Piecewise Deterministic Markov Processes
    O. L. V. Costa
    F. Dufour
    Applied Mathematics & Optimization, 2011, 63 : 357 - 384