Optimal Control of Probability on a Target Set for Continuous-Time Markov Chains

被引:1
|
作者
Ma, Chenglin [1 ]
Zhao, Huaizhong [2 ,3 ]
机构
[1] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[2] Univ Durham, Dept Math Sci, Durham DH1 3LE, England
[3] Shandong Univ, Res Ctr Math & Interdisciplinary Sci, Qingdao 266237, Peoples R China
关键词
Markov processes; Optimal control; Dynamic programming; Process control; Games; Aerospace electronics; Safety; Controlled Markov chains; dynamic programming principle (DPP); Hamilton-Jacobi-Bellman (HJB) equation; optimal controls; risk probability criteria; DECISION-PROCESSES; RISK PROBABILITY; CRITERION;
D O I
10.1109/TAC.2023.3278789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a stochastic optimal control problem is considered for a continuous-time Markov chain taking values in a denumerable state space over a fixed finite horizon. The optimality criterion is the probability that the process remains in a target set before and at a certain time. The optimal value is a superadditive capacity of target sets. Under some minor assumptions for the controlled Markov process, we establish the dynamic programming principle, based on which we prove that the value function is a classical solution of the Hamilton-Jacobi-Bellman (HJB) equation on a discrete lattice space. We then prove that there exists an optimal deterministic Markov control under the compactness assumption of control domain. We further prove that the value function is the unique solution of the HJB equation. We also consider the case starting from the outside of the target set and give the corresponding results. Finally, we apply our results to two examples.
引用
收藏
页码:1202 / 1209
页数:8
相关论文
共 50 条
  • [1] Optimal Control of Probability on a Target Set for Continuous-Time Markov Chains
    Shandong University, School of Mathematics, Jinan
    250100, China
    不详
    DH1 3LE, United Kingdom
    不详
    266237, China
    [J]. IEEE Trans Autom Control, 2 (1202-1209):
  • [2] OPTIMAL CONTROL OF CONTINUOUS-TIME MARKOV CHAINS WITH NOISE-FREE OBSERVATION
    Calvia, A.
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (03) : 2000 - 2035
  • [3] Control of continuous-time Markov chains with safety constraints
    Hsu, Shun-Pin
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (05) : 492 - 503
  • [4] Perturbations of continuous-time Markov chains
    Li, Pei-Sen
    [J]. STATISTICS & PROBABILITY LETTERS, 2017, 125 : 17 - 24
  • [5] Imprecise continuous-time Markov chains
    Krak, Thomas
    De Bock, Jasper
    Siebes, Arno
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 88 : 452 - 528
  • [6] Filtering of continuous-time Markov chains
    Aggoun, L
    Benkherouf, L
    Tadj, L
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (12) : 73 - 83
  • [7] Continuous-time controlled Markov chains
    Guo, XP
    Hernández-Lerma, O
    [J]. ANNALS OF APPLIED PROBABILITY, 2003, 13 (01): : 363 - 388
  • [8] Integrals for continuous-time Markov chains
    Pollett, PK
    [J]. MATHEMATICAL BIOSCIENCES, 2003, 182 (02) : 213 - 225
  • [9] Convergence properties of continuous-time Markov chains with application to target search
    Jun, M
    Jeffcoat, DE
    [J]. ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 662 - 667
  • [10] Ergodic Control of Continuous-Time Markov Chains with Pathwise Constraints
    Prieto-Rumeau, Tomas
    Hernandez-Lerma, Onesimo
    [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 2640 - 2645