On Gradient-Based Learning in Continuous Games

被引:64
|
作者
Mazumdar, Eric [1 ]
Ratliff, Lillian J. [2 ]
Sastry, S. Shankar [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Washington, Seattle, WA 98195 USA
来源
基金
美国国家科学基金会;
关键词
continuous games; gradient-based algorithms; multiagent learning; EQUILIBRIA;
D O I
10.1137/18M1231298
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We introduce a general framework for competitive gradient-based learning that encompasses a wide breadth of multiagent learning algorithms, and analyze the limiting behavior of competitive gradient-based learning algorithms using dynamical systems theory. For both general-sum and potential games, we characterize a nonnegligible subset of the local Nash equilibria that will be avoided if each agent employs a gradient-based learning algorithm. We also shed light on the issue of convergence to non-Nash strategies in general- and zero-sum games, which may have no relevance to the underlying game, and arise solely due to the choice of algorithm. The existence and frequency of such strategies may explain some of the difficulties encountered when using gradient descent in zero-sum games as, e.g., in the training of generative adversarial networks. To reinforce the theoretical contributions, we provide empirical results that highlight the frequency of linear quadratic dynamic games (a benchmark for multiagent reinforcement learning) that admit global Nash equilibria that are almost surely avoided by policy gradient.
引用
收藏
页码:103 / 131
页数:29
相关论文
共 50 条
  • [1] Convergence Analysis of Gradient-Based Learning in Continuous Games
    Chasnov, Benjamin
    Ratliff, Lillian
    Mazumdar, Eric
    Burden, Samuel
    [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 935 - 944
  • [2] Gradient-based learning and optimization
    Cao, XR
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
  • [3] Gradient-Based Competitive Learning: Theory
    Giansalvo Cirrincione
    Vincenzo Randazzo
    Pietro Barbiero
    Gabriele Ciravegna
    Eros Pasero
    [J]. Cognitive Computation, 2024, 16 : 608 - 623
  • [4] Gradient-Based Learning of Finite Automata
    del Pozo Romero, Juan Fdez
    Lago-Fernandez, Luis F.
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 294 - 305
  • [5] Gradient-Based Competitive Learning: Theory
    Cirrincione, Giansalvo
    Randazzo, Vincenzo
    Barbiero, Pietro
    Ciravegna, Gabriele
    Pasero, Eros
    [J]. COGNITIVE COMPUTATION, 2024, 16 (02) : 608 - 623
  • [6] Categorical Foundations of Gradient-Based Learning
    Cruttwell, Geoffrey S. H.
    Gavranovic, Bruno
    Ghani, Neil
    Wilson, Paul
    Zanasi, Fabio
    [J]. PROGRAMMING LANGUAGES AND SYSTEMS, ESOP 2022, 2022, 13240 : 1 - 28
  • [7] Failures of Gradient-Based Deep Learning
    Shalev-Shwartz, Shai
    Shamir, Ohad
    Shammah, Shaked
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [8] Direct gradient-based reinforcement learning
    Baxter, J
    Bartlett, PL
    [J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 271 - 274
  • [9] Topological Gradient-based Competitive Learning
    Barbiero, Pietro
    Ciravegna, Gabriele
    Randazzo, Vincenzo
    Pasero, Eros
    Cirrincione, Giansalvo
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Object recognition with gradient-based learning
    LeCun, Y
    Haffner, P
    Bottou, L
    Bengio, Y
    [J]. SHAPE, CONTOUR AND GROUPING IN COMPUTER VISION, 1999, 1681 : 319 - 345