Constrained Partially Observed Markov Decision Processes With Probabilistic Criteria for Adaptive Sequential Detection

被引:2
|
作者
Chen, Richard C. [1 ]
Wagner, Kevin [1 ]
Blankenship, Gilmer L. [2 ]
机构
[1] USN, Res Lab, Washington, DC 20375 USA
[2] Univ Maryland, Dept Elect Engn, College Pk, MD 20742 USA
关键词
Dynamic programming; partially observed Markov decision process; probabilistic criteria; target confirmation;
D O I
10.1109/TAC.2012.2208312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming equations are derived which characterize the optimal value functions for a partially observed constrained Markov decision process problem with both total cost and probabilistic criteria. More specifically, the goal is to minimize an expected total cost subject to a constraint on the probability that another total cost exceeds a prescribed threshold. The Markov decision process is partially observed, but it is assumed that the constraint costs are available to the controller, i.e., they are fully observed. The problem is motivated by an adaptive sequential detection application. The application of the dynamic programming results to optimal adaptive truncated sequential detection is demonstrated using an example involving the optimization of a radar detection process.
引用
收藏
页码:487 / 493
页数:8
相关论文
共 50 条
  • [1] On the adaptive control of a class of partially observed Markov decision processes
    Hsu, Shun-Pin
    Arapostathis, Ari
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2011, 380 (01) : 1 - 9
  • [2] On the Adaptive Control of a Class of Partially Observed Markov Decision Processes
    Hsu, Shun-Pin
    Chuang, Dong-Ming
    Arapostathis, Ari
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 5635 - +
  • [3] On the complexity of partially observed Markov decision processes
    Burago, D
    deRougemont, M
    Slissenko, A
    THEORETICAL COMPUTER SCIENCE, 1996, 157 (02) : 161 - 183
  • [4] Experimental Design for Partially Observed Markov Decision Processes
    Thorbergsson, Leifur
    Hooker, Giles
    SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2018, 6 (02): : 549 - 567
  • [5] Partially observed Markov decision processes with binomial observations
    Ben-Zvi, Tal
    Grosfeld-Nir, Abraham
    OPERATIONS RESEARCH LETTERS, 2013, 41 (02) : 201 - 206
  • [6] Constrained Markov decision processes with first passage criteria
    Yonghui Huang
    Qingda Wei
    Xianping Guo
    Annals of Operations Research, 2013, 206 : 197 - 219
  • [7] Constrained Markov decision processes with first passage criteria
    Huang, Yonghui
    Wei, Qingda
    Guo, Xianping
    ANNALS OF OPERATIONS RESEARCH, 2013, 206 (01) : 197 - 219
  • [8] Information Relaxation Bounds for Partially Observed Markov Decision Processes
    Haugh, Martin B.
    Lacedelli, Octavio Ruiz
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (08) : 3256 - 3271
  • [9] Entropy-Regularized Partially Observed Markov Decision Processes
    Molloy, Timothy L.
    Nair, Girish N.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 379 - 386
  • [10] SOLUTION PROCEDURES FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
    WHITE, CC
    SCHERER, WT
    OPERATIONS RESEARCH, 1989, 37 (05) : 791 - 797