Constrained Partially Observed Markov Decision Processes With Probabilistic Criteria for Adaptive Sequential Detection

被引：2

作者：

Chen, Richard C. ^{[1
]}

Wagner, Kevin ^{[1
]}

Blankenship, Gilmer L. ^{[2
]}

机构：

[1] USN, Res Lab, Washington, DC 20375 USA

[2] Univ Maryland, Dept Elect Engn, College Pk, MD 20742 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2013年 / 58卷 / 02期

关键词：

Dynamic programming; partially observed Markov decision process; probabilistic criteria; target confirmation;

D O I：

10.1109/TAC.2012.2208312

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dynamic programming equations are derived which characterize the optimal value functions for a partially observed constrained Markov decision process problem with both total cost and probabilistic criteria. More specifically, the goal is to minimize an expected total cost subject to a constraint on the probability that another total cost exceeds a prescribed threshold. The Markov decision process is partially observed, but it is assumed that the constraint costs are available to the controller, i.e., they are fully observed. The problem is motivated by an adaptive sequential detection application. The application of the dynamic programming results to optimal adaptive truncated sequential detection is demonstrated using an example involving the optimization of a radar detection process.

引用

页码：487 / 493

页数：8

共 50 条

[31] PARTIALLY OBSERVED CONTROL OF MARKOV-PROCESSES
HIJAB, O
LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 177 : 151 - 158
[32] ESTIMATION FOR PARTIALLY OBSERVED MARKOV-PROCESSES
THOMPSON, ME
KASEKE, TN
STOCHASTIC HYDROLOGY AND HYDRAULICS, 1995, 9 (01): : 33 - 47
[33] Learning in Constrained Markov Decision Processes
Singh, Rahul
Gupta, Abhishek
Shroff, Ness B.
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
[34] Active Trajectory Estimation for Partially Observed Markov Decision Processes via Conditional Entropy
Molloy, Timothy L.
Nair, Girish N.
2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 385 - 391
[35] FINITE-MEMORY SUBOPTIMAL DESIGN FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
WHITE, CC
SCHERER, WT
OPERATIONS RESEARCH, 1994, 42 (03) : 439 - 455
[36] Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes
Kara, Ali Devran
Yuksel, Serdar
JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
[37] Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes
Kara, Ali Devran
Yüksel, Serdar
Journal of Machine Learning Research, 2022, 23
[38] Total reward criteria for unconstrained/constrained continuous-time Markov decision processes
Xianping Guo
Lanlan Zhang
Journal of Systems Science and Complexity, 2011, 24 : 491 - 505
[39] Total reward criteria for unconstrained/constrained continuous-time Markov decision processes
Guo, Xianping
Zhang, Lanlan
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2011, 24 (03) : 491 - 505
[40] TOTAL REWARD CRITERIA FOR UNCONSTRAINED/CONSTRAINED CONTINUOUS-TIME MARKOV DECISION PROCESSES
Xianping GUO School of Mathematics and Computational Science
Journal of Systems Science & Complexity, 2011, 24 (03) : 491 - 505

← 1 2 3 4 5 →