The Complexity of Markov Equilibrium in Stochastic Games

被引:0
|
作者
Daskalakis, Constantinos [1 ]
Golowich, Noah [1 ]
Zhang, Kaiqing [2 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Maryland, College Pk, MD 20740 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that computing approximate stationary Markov coarse correlated equilibria (CCE) in general-sum stochastic games is PPAD-hard, even when there are two players, the game is turn-based, the discount factor is an absolute constant, and the approximation is an absolute constant. Our intractability results stand in sharp contrast to the results in normal-form games, where exact CCEs are efficiently computable. A fortiori, our results imply that, in the setting of multi-agent reinforcement learning (MARL), it is computationally hard to learn stationary Markov CCE policies in stochastic games, even when the interaction is two-player and turn-based, and both the discount factor and the desired approximation of the learned policies is an absolute constant. In turn, these results stand in sharp contrast to single-agent reinforcement learning (RL) where near-optimal stationary Markov policies can be computationally efficiently learned. Complementing our intractability results for stationary Markov CCEs, we provide a decentralized algorithm (assuming shared randomness among players) for learning a nonstationary Markov CCE policy with polynomial time and sample complexity in all problem parameters. Previous work for learning Markov CCE policies all required exponential time and sample complexity in the number of players. In the balance, our work advocates for the use of nonstationary Markov CCE policies as a computationally and statistically tractable solution concept in MARL, advancing an important and outstanding frontier in machine learning.
引用
收藏
页数:55
相关论文
共 50 条
  • [1] On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
    Deng, Xiaotie
    Li, Ningyuan
    Mguni, David
    Wang, Jun
    Yang, Yaodong
    [J]. NATIONAL SCIENCE REVIEW, 2023, 10 (01)
  • [2] On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
    Xiaotie Deng
    Ningyuan Li
    David Mguni
    Jun Wang
    Yaodong Yang
    [J]. National Science Review, 2023, 10 (01) : 288 - 301
  • [3] Pure strategy Markov equilibrium in stochastic games with a continuum of players
    Chakrabarti, SK
    [J]. JOURNAL OF MATHEMATICAL ECONOMICS, 2003, 39 (07) : 693 - 724
  • [4] THE COMPLEXITY OF STOCHASTIC GAMES
    CONDON, A
    [J]. INFORMATION AND COMPUTATION, 1992, 96 (02) : 203 - 224
  • [5] On the complexity of computing Markov perfect equilibrium in general-sum stochastic games (vol 10, nwac256, 2023)
    Deng, Xiaotie
    Li, Ningyuan
    Mguni, David
    Wang, Jun
    Yang, Yaodong
    [J]. NATIONAL SCIENCE REVIEW, 2023, 10 (02)
  • [6] Communication Complexity of Stochastic Games
    Krishnamurthy, Nagarajan
    Parthasarathy, T.
    Ravindran, G.
    [J]. 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 411 - 417
  • [7] The complexity of stochastic Muller games
    Chatterjee, Krishnendu
    [J]. INFORMATION AND COMPUTATION, 2012, 211 : 29 - 48
  • [8] A constructive geometrical approach to the uniqueness of Markov stationary equilibrium in stochastic games of intergenerational altruism
    Balbus, Lukasz
    Reffett, Kevin
    Wozny, Lukasz
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2013, 37 (05): : 1019 - 1039
  • [9] Correlated equilibrium in stochastic games
    Solan, E
    Vieille, N
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2002, 38 (02) : 362 - 399
  • [10] Cyclic Markov equilibria in stochastic games
    Flesch, J
    Thuijsman, F
    Vrieze, K
    [J]. INTERNATIONAL JOURNAL OF GAME THEORY, 1997, 26 (03) : 303 - 314