On the undecidability of probabilistic planning and related stochastic optimization problems

被引：124

作者：

Madani, O ^{[1
]}

Hanks, S

Condon, A

机构：

[1] Univ Alberta, Dept Comp Sci, Edmonton, AB TG6 2E8, Canada

[2] Univ Washington, Inst Technol, Tacoma, WA 98402 USA

[3] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T IZ4, Canada

来源：

ARTIFICIAL INTELLIGENCE | 2003年 / 147卷 / 1-2期

关键词：

probabilistic planning; undecidability; computability; Markov decision processes; computational complexity; infinity-horizon; partial observability; unobservability; stochastic optimization; discounted;

D O I：

10.1016/S0004-3702(02)00378-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automated planning, the problem of how an agent achieves a goal given a repertoire of actions, is one of the foundational and most widely studied problems in the AI literature. The original formulation of the problem makes strong assumptions regarding the agent's knowledge and control over the world, namely that its information is complete and correct, and that the results of its actions are deterministic and known. Recent research in planning under uncertainty has endeavored to relax these assumptions, providing formal and computation models wherein the agent has incomplete or noisy information about the world and has noisy sensors and effectors. This research has mainly taken one of two approaches: extend the classical planning paradigm to a semantics that admits uncertainty, or adopt another framework for approaching the problem, most commonly the Markov Decision Process (MDP) model. This paper presents a complexity analysis of planning under uncertainty. It begins with the "probabilistic classical planning" problem, showing that problem to be formally undecidable. This fundamental result is then applied to a broad class of stochastic optimization problems, in brief any problem statement where the agent (a) operates over an infinite or indefinite time horizon, and (b) has available only probabilistic information about the system's state. Undecidability is established for policy-existence problems for partially observable infinite-horizon Markov decision processes under discounted and undiscounted total reward models, average-reward models, and state-avoidance models. The results also apply to corresponding approximation problems with undiscounted objective functions. The paper answers a significant open question raised by Papadimitriou and Tsitsiklis [Math. Oper. Res. 12 (3) (1987) 441-450] about the complexity of infinite horizon POMDPs. (C) 2003 Elsevier Science B.V. All rights reserved.

引用

页码：5 / 34

页数：30

共 50 条

[1] On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
Madani, O
Hanks, S
Condon, A
[J]. SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 541 - 548
[2] Stochastic Optimization for Reactive Power Planning Problems
Dufour, Raphael
Labeau, Pierre-Etienne
Henneaux, Pierre
Karoui, Karim
Merckx, Christian
[J]. 2016 IEEE INTERNATIONAL ENERGY CONFERENCE (ENERGYCON), 2016,
[3] Integration and computational issues in stochastic design and planning optimization problems
Bernardo, FP
Pistikopoulos, EN
Saraiva, PM
[J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1999, 38 (08) : 3056 - 3068
[4] Efficient optimization in stochastic production planning problems with product substitution
Tsai, Shing Chih
Yeh, Yingchieh
Wang, Honggang
Chou, Tsung Ching
[J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 164
[5] PROBABILISTIC OPTIMIZATION PROBLEMS
KIBZUN, AI
MALYSHEV, VV
[J]. SOVIET JOURNAL OF COMPUTER AND SYSTEMS SCIENCES, 1990, 28 (04): : 52 - 60
[6] UNDECIDABILITY OF BISIMILARITY FOR PETRI NETS AND SOME RELATED PROBLEMS
JANCAR, P
[J]. THEORETICAL COMPUTER SCIENCE, 1995, 148 (02) : 281 - 301
[7] Estimation of the Necessary Sample Size for Approximation of Stochastic Optimization Problems with Probabilistic Criteria
Ivanov, Sergey V.
Zhenevskaya, Irina D.
[J]. MATHEMATICAL OPTIMIZATION THEORY AND OPERATIONS RESEARCH, 2019, 11548 : 552 - 564
[8] SOLUTION OF OUTPUT FEEDBACK STABILIZATION AND RELATED PROBLEMS BY STOCHASTIC OPTIMIZATION
LUUS, R
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1975, 20 (06) : 820 - 821
[9] THE UNDECIDABILITY OF THE DISJUNCTION PROPERTY OF PROPOSITIONAL LOGICS AND OTHER RELATED PROBLEMS
CHAGROV, A
ZAKHARYASCHEV, M
[J]. JOURNAL OF SYMBOLIC LOGIC, 1993, 58 (03) : 967 - 1002
[10] Automaton semigroups and groups: On the undecidability of problems related to freeness and finiteness
Daniele D’Angeli
Emanuele Rodaro
Jan Philipp Wächter
[J]. Israel Journal of Mathematics, 2020, 237 : 15 - 52

← 1 2 3 4 5 →