A SURVEY OF ALGORITHMIC METHODS FOR PARTIALLY OBSERVED MARKOV DECISION PROCESSES

被引：296

作者：

Lovejoy, William S. ^{[1
]}

机构：

[1] Stanford Univ, Grad Sch Business, Stanford, CA 94305 USA

来源：

ANNALS OF OPERATIONS RESEARCH | 1991年 / 28卷 / 01期

关键词：

D O I：

10.1007/BF02055574

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

A partially observed Markov decision process (POMDP) is a generalization of a Markov decision process that allows for incomplete information regarding the state of the system. The significant applied potential for such processes remains largely unrealized, due to an historical lack of tractable solution methodologies. This paper reviews some of the current algorithmic alternatives for solving discrete-time, finite POMDPs over both finite and infinite horizons. The major impediment to exact solution is that, even with a finite set of internal system states, the set of possible information states is uncountably infinite. Finite algorithms are theoretically available for exact solution of the finite horizon problem, but these are computationally intractable for even modest-sized problems. Several approximation methodoiogies are reviewed that have the potential to generate computationally feasible, high precision solutions.

引用

页码：47 / 65

页数：19

共 50 条

[1] On the complexity of partially observed Markov decision processes
Burago, D
deRougemont, M
Slissenko, A
THEORETICAL COMPUTER SCIENCE, 1996, 157 (02) : 161 - 183
[2] Experimental Design for Partially Observed Markov Decision Processes
Thorbergsson, Leifur
Hooker, Giles
SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2018, 6 (02): : 549 - 567
[3] Partially observed Markov decision processes with binomial observations
Ben-Zvi, Tal
Grosfeld-Nir, Abraham
OPERATIONS RESEARCH LETTERS, 2013, 41 (02) : 201 - 206
[4] Information Relaxation Bounds for Partially Observed Markov Decision Processes
Haugh, Martin B.
Lacedelli, Octavio Ruiz
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (08) : 3256 - 3271
[5] On the adaptive control of a class of partially observed Markov decision processes
Hsu, Shun-Pin
Arapostathis, Ari
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2011, 380 (01) : 1 - 9
[6] Entropy-Regularized Partially Observed Markov Decision Processes
Molloy, Timothy L.
Nair, Girish N.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 379 - 386
[7] SOLUTION PROCEDURES FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
WHITE, CC
SCHERER, WT
OPERATIONS RESEARCH, 1989, 37 (05) : 791 - 797
[8] Whittle Index for Partially Observed Binary Markov Decision Processes
Borkar, Vivek S.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (12) : 6614 - 6618
[9] On the Adaptive Control of a Class of Partially Observed Markov Decision Processes
Hsu, Shun-Pin
Chuang, Dong-Ming
Arapostathis, Ari
2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 5635 - +
[10] Complexity Bounds for Deterministic Partially Observed Markov Decision Processes
Cyrille Vessaire
Pierre Carpentier
Jean-Philippe Chancelier
Michel De Lara
Alejandro Rodríguez-Martínez
Annals of Operations Research, 2025, 344 (1) : 345 - 382

← 1 2 3 4 5 →