On the Computational Complexity of Stochastic Controller Optimization in POMDPs

被引:40
|
作者
Vlassis, Nikos [1 ]
Littman, Michael L. [2 ]
Barber, David [3 ]
机构
[1] Univ Luxembourg, Luxembourg Ctr Syst Biomed, 7 Ave Hauts Fourneaux, L-4362 Esch Belval, Luxembourg
[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
[3] UCL, Dept Comp Sci, London WC1E 6BT, England
基金
美国国家科学基金会;
关键词
Performance; Partially observable Markov decision process; stochastic controller; bilinear program; computational complexity; Motzkin-Straus theorem; sum-of-square-roots problem; matrix fractional program; computations on polynomials; nonlinear optimization;
D O I
10.1145/2382559.2382563
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We show that the problem of finding an optimal stochastic blind controller in a Markov decision process is an NP-hard problem. The corresponding decision problem is NP-hard in PSPACE and SQRT-SUM-hard, hence placing it in NP would imply breakthroughs in long-standing open problems in computer science. Our result establishes that the more general problem of stochastic controller optimization in POMDPs is also NP-hard. Nonetheless, we outline a special case that is convex and admits efficient global solutions.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] Erratum to: Computational complexity of stochastic programming problems
    Martin Dyer
    Leen Stougie
    [J]. Mathematical Programming, 2015, 153 : 723 - 725
  • [12] Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
    Christopher Amato
    Daniel S. Bernstein
    Shlomo Zilberstein
    [J]. Autonomous Agents and Multi-Agent Systems, 2010, 21 : 293 - 320
  • [13] Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs
    Nair, Ranjit
    Varakantham, Pradeep
    Tambe, Milind
    Yokoo, Makoto
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1758 - 1760
  • [14] Computational stochastic control: Basic foundations, complexity and techniques
    Hanson, FB
    [J]. 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 3024 - 3029
  • [15] Stochastic algorithm computational complexity comparison on test functions
    Cesario, Nicola
    Petti, Palma
    Pirozzi, Francesco
    [J]. APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 293 - 302
  • [16] Performance Enhanced RES Current Controller with Reduced Computational Complexity
    Ali, Zunaib
    Christofides, Nicholas
    Polycarpou, Alexis
    [J]. 2017 7TH INTERNATIONAL CONFERENCE ON MODERN POWER SYSTEMS (MPS), 2017,
  • [17] Methods of reducing the computational complexity of predictive controller with induction motors
    Wrobel, K.
    Serkies, P.
    Szabat, K.
    [J]. 2015 IEEE 11TH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND DRIVE SYSTEMS (PEDS 2015), 2015, : 1060 - 1063
  • [18] Design of a robust model predictive controller with reduced computational complexity
    Razi, M.
    Haeri, M.
    [J]. ISA TRANSACTIONS, 2014, 53 (06) : 1754 - 1759
  • [19] Local Minimax Complexity of Stochastic Convex Optimization
    Zhu, Yuancheng
    Chatterjee, Sabyasachi
    Duchi, John
    Lafferty, John
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [20] An Incremental Fuzzy Controller for Large Dec-POMDPs
    Hamzeloo, Sam
    Jahromi, Mansoor Zolghadri
    [J]. 2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 166 - 171