Constrained Partially Observed Markov Decision Processes With Probabilistic Criteria for Adaptive Sequential Detection

被引:2
|
作者
Chen, Richard C. [1 ]
Wagner, Kevin [1 ]
Blankenship, Gilmer L. [2 ]
机构
[1] USN, Res Lab, Washington, DC 20375 USA
[2] Univ Maryland, Dept Elect Engn, College Pk, MD 20742 USA
关键词
Dynamic programming; partially observed Markov decision process; probabilistic criteria; target confirmation;
D O I
10.1109/TAC.2012.2208312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming equations are derived which characterize the optimal value functions for a partially observed constrained Markov decision process problem with both total cost and probabilistic criteria. More specifically, the goal is to minimize an expected total cost subject to a constraint on the probability that another total cost exceeds a prescribed threshold. The Markov decision process is partially observed, but it is assumed that the constraint costs are available to the controller, i.e., they are fully observed. The problem is motivated by an adaptive sequential detection application. The application of the dynamic programming results to optimal adaptive truncated sequential detection is demonstrated using an example involving the optimization of a radar detection process.
引用
收藏
页码:487 / 493
页数:8
相关论文
共 50 条
  • [21] Bayesian adaptive control of discrete time partially observed Markov processes
    Stettner, L
    STOCHASTIC THEORY AND CONTROL, PROCEEDINGS, 2002, 280 : 435 - 446
  • [22] Constrained continuous-time Markov decision processes with average criteria
    Lanlan Zhang
    Xianping Guo
    Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
  • [23] On constrained Markov decision processes
    Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
    不详
    Oper Res Lett, 1 (25-28):
  • [24] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [25] Constrained continuous-time Markov decision processes with average criteria
    Zhang, Lanlan
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
  • [26] ADAPTIVE COMPRESSIVE SAMPLING USING PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES
    Zahedi, Ramin
    Krakow, Lucas W.
    Chong, Edwin K. P.
    Pezeshki, Ali
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5269 - 5272
  • [28] Probabilistic Hyperproperties of Markov Decision Processes
    Dimitrova, Rayna
    Finkbeiner, Bernd
    Torfah, Hazem
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 484 - 500
  • [29] Probabilistic opacity for Markov decision processes
    Berard, Beatrice
    Chatterjee, Krishnendu
    Sznajder, Nathalie
    INFORMATION PROCESSING LETTERS, 2015, 115 (01) : 52 - 59
  • [30] PARTIALLY OBSERVED CONTROL OF MARKOV-PROCESSES
    HIJAB, O
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 184 : 248 - 255