On the undecidability of probabilistic planning and related stochastic optimization problems

被引:124
|
作者
Madani, O [1 ]
Hanks, S
Condon, A
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB TG6 2E8, Canada
[2] Univ Washington, Inst Technol, Tacoma, WA 98402 USA
[3] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T IZ4, Canada
关键词
probabilistic planning; undecidability; computability; Markov decision processes; computational complexity; infinity-horizon; partial observability; unobservability; stochastic optimization; discounted;
D O I
10.1016/S0004-3702(02)00378-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated planning, the problem of how an agent achieves a goal given a repertoire of actions, is one of the foundational and most widely studied problems in the AI literature. The original formulation of the problem makes strong assumptions regarding the agent's knowledge and control over the world, namely that its information is complete and correct, and that the results of its actions are deterministic and known. Recent research in planning under uncertainty has endeavored to relax these assumptions, providing formal and computation models wherein the agent has incomplete or noisy information about the world and has noisy sensors and effectors. This research has mainly taken one of two approaches: extend the classical planning paradigm to a semantics that admits uncertainty, or adopt another framework for approaching the problem, most commonly the Markov Decision Process (MDP) model. This paper presents a complexity analysis of planning under uncertainty. It begins with the "probabilistic classical planning" problem, showing that problem to be formally undecidable. This fundamental result is then applied to a broad class of stochastic optimization problems, in brief any problem statement where the agent (a) operates over an infinite or indefinite time horizon, and (b) has available only probabilistic information about the system's state. Undecidability is established for policy-existence problems for partially observable infinite-horizon Markov decision processes under discounted and undiscounted total reward models, average-reward models, and state-avoidance models. The results also apply to corresponding approximation problems with undiscounted objective functions. The paper answers a significant open question raised by Papadimitriou and Tsitsiklis [Math. Oper. Res. 12 (3) (1987) 441-450] about the complexity of infinite horizon POMDPs. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:5 / 34
页数:30
相关论文
共 50 条
  • [1] On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
    Madani, O
    Hanks, S
    Condon, A
    [J]. SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 541 - 548
  • [2] Stochastic Optimization for Reactive Power Planning Problems
    Dufour, Raphael
    Labeau, Pierre-Etienne
    Henneaux, Pierre
    Karoui, Karim
    Merckx, Christian
    [J]. 2016 IEEE INTERNATIONAL ENERGY CONFERENCE (ENERGYCON), 2016,
  • [3] Integration and computational issues in stochastic design and planning optimization problems
    Bernardo, FP
    Pistikopoulos, EN
    Saraiva, PM
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1999, 38 (08) : 3056 - 3068
  • [4] Efficient optimization in stochastic production planning problems with product substitution
    Tsai, Shing Chih
    Yeh, Yingchieh
    Wang, Honggang
    Chou, Tsung Ching
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 164
  • [5] PROBABILISTIC OPTIMIZATION PROBLEMS
    KIBZUN, AI
    MALYSHEV, VV
    [J]. SOVIET JOURNAL OF COMPUTER AND SYSTEMS SCIENCES, 1990, 28 (04): : 52 - 60
  • [6] UNDECIDABILITY OF BISIMILARITY FOR PETRI NETS AND SOME RELATED PROBLEMS
    JANCAR, P
    [J]. THEORETICAL COMPUTER SCIENCE, 1995, 148 (02) : 281 - 301
  • [7] Estimation of the Necessary Sample Size for Approximation of Stochastic Optimization Problems with Probabilistic Criteria
    Ivanov, Sergey V.
    Zhenevskaya, Irina D.
    [J]. MATHEMATICAL OPTIMIZATION THEORY AND OPERATIONS RESEARCH, 2019, 11548 : 552 - 564
  • [8] SOLUTION OF OUTPUT FEEDBACK STABILIZATION AND RELATED PROBLEMS BY STOCHASTIC OPTIMIZATION
    LUUS, R
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1975, 20 (06) : 820 - 821
  • [9] THE UNDECIDABILITY OF THE DISJUNCTION PROPERTY OF PROPOSITIONAL LOGICS AND OTHER RELATED PROBLEMS
    CHAGROV, A
    ZAKHARYASCHEV, M
    [J]. JOURNAL OF SYMBOLIC LOGIC, 1993, 58 (03) : 967 - 1002
  • [10] Automaton semigroups and groups: On the undecidability of problems related to freeness and finiteness
    Daniele D’Angeli
    Emanuele Rodaro
    Jan Philipp Wächter
    [J]. Israel Journal of Mathematics, 2020, 237 : 15 - 52