The complexity of decentralized control of Markov decision processes

被引:550
|
作者
Bernstein, DS [1 ]
Givan, R
Immerman, N
Zilberstein, S
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
[2] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
关键词
computational complexity; Markov decision process; decentralized control;
D O I
10.1287/moor.27.4.819.297
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP not equal NEXP, the problems require superexponential time to solve in the worst case.
引用
收藏
页码:819 / 840
页数:22
相关论文
共 50 条
  • [31] Control of Markov Decision Processes from PCTL specifications
    Lahijanian, M.
    Andersson, S. B.
    Belta, C.
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 311 - 316
  • [32] Optimal control in light traffic Markov decision processes
    Koole, G
    Passchier, O
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1997, 45 (01) : 63 - 79
  • [33] Markov decision processes
    White, D.J.
    Journal of the Operational Research Society, 1995, 46 (06):
  • [34] Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes
    Tilak, Omkar
    Mukhopadhyay, Snehasis
    AI COMMUNICATIONS, 2011, 24 (04) : 293 - 309
  • [35] Markov Decision Processes
    Bäuerle N.
    Rieder U.
    Jahresbericht der Deutschen Mathematiker-Vereinigung, 2010, 112 (4) : 217 - 243
  • [36] Decentralized Markov decision processes for handling temporal and resource constraints in a multiple robot system
    Beynier, Aurelie
    Mouaddib, Abdel-Illah
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 6, 2007, : 191 - +
  • [37] Approximation of Stationary Control Policies by Quantized Control in Markov Decision Processes
    Saldi, Noel
    Linder, Tamas
    Yueksel, Serdar
    2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 78 - 84
  • [38] TOWARD THEORETICAL UNDERSTANDINGS OF ROBUST MARKOV DECISION PROCESSES: SAMPLE COMPLEXITY AND ASYMPTOTICS
    Yang, Wenhao
    Zhang, Liangyu
    Zhang, Zhihua
    ANNALS OF STATISTICS, 2022, 50 (06): : 3223 - 3248
  • [39] Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions
    Omidshafiei, Shayegan
    Agha-Mohammadi, Ali-Akbar
    Amato, Christopher
    Liu, Shih-Yuan
    How, Jonathan P.
    Vian, John
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (02): : 231 - 258
  • [40] A decentralized algorithm for a mean field control problem of piecewise deterministic Markov processes
    Seguret, Adrien
    Le Corre, Thomas
    Oudjane, Nadia
    ESAIM-PROBABILITY AND STATISTICS, 2024, 28 : 22 - 45