The complexity of decentralized control of Markov decision processes

被引:530
|
作者
Bernstein, DS [1 ]
Givan, R
Immerman, N
Zilberstein, S
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
[2] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
关键词
computational complexity; Markov decision process; decentralized control;
D O I
10.1287/moor.27.4.819.297
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP not equal NEXP, the problems require superexponential time to solve in the worst case.
引用
收藏
页码:819 / 840
页数:22
相关论文
共 50 条
  • [1] Decentralized Control of Partially Observable Markov Decision Processes
    Amato, Christopher
    Chowdhary, Girish
    Geramifard, Alborz
    Uere, N. Kemal
    Kochenderfer, Mykel J.
    [J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
  • [2] Policy Iteration for Decentralized Control of Markov Decision Processes
    Bernstein, Daniel S.
    Amato, Christopher
    Hansen, Eric A.
    Zilberstein, Shlomo
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 89 - 132
  • [3] UAV Formation Shape Control via Decentralized Markov Decision Processes
    Azam, Md Ali
    Mittelmann, Hans D.
    Ragi, Shankarachary
    [J]. ALGORITHMS, 2021, 14 (03)
  • [4] Relaxation for Constrained Decentralized Markov Decision Processes
    Xu, Jie
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1313 - 1314
  • [5] Complexity issues in Markov decision processes
    Goldsmith, J
    Mundhenk, M
    [J]. THIRTEENTH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY - PROCEEDINGS, 1998, : 272 - 280
  • [6] The complexity of synchronizing Markov decision processes
    Doyen, Laurent
    Massart, Thierry
    Shirmohammadi, Mahsa
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2019, 100 : 96 - 129
  • [7] THE COMPLEXITY OF MARKOV DECISION-PROCESSES
    PAPADIMITRIOU, CH
    TSITSIKLIS, JN
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (03) : 441 - 450
  • [8] Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes
    Sharma, Rajneesh
    Spaan, Matthijs T. J.
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1422 - 1429
  • [9] Solving transition independent decentralized Markov decision processes
    Becker, Raphen
    Zilberstein, Shlomo
    Lesser, Victor
    Goldman, Claudia V.
    [J]. Journal of Artificial Intelligence Research, 1600, 22 : 423 - 455
  • [10] Average-Reward Decentralized Markov Decision Processes
    Petrik, Marek
    Zilberstein, Shlomo
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002