On Gradient-Based Learning in Continuous Games

被引:64
|
作者
Mazumdar, Eric [1 ]
Ratliff, Lillian J. [2 ]
Sastry, S. Shankar [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Washington, Seattle, WA 98195 USA
来源
基金
美国国家科学基金会;
关键词
continuous games; gradient-based algorithms; multiagent learning; EQUILIBRIA;
D O I
10.1137/18M1231298
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We introduce a general framework for competitive gradient-based learning that encompasses a wide breadth of multiagent learning algorithms, and analyze the limiting behavior of competitive gradient-based learning algorithms using dynamical systems theory. For both general-sum and potential games, we characterize a nonnegligible subset of the local Nash equilibria that will be avoided if each agent employs a gradient-based learning algorithm. We also shed light on the issue of convergence to non-Nash strategies in general- and zero-sum games, which may have no relevance to the underlying game, and arise solely due to the choice of algorithm. The existence and frequency of such strategies may explain some of the difficulties encountered when using gradient descent in zero-sum games as, e.g., in the training of generative adversarial networks. To reinforce the theoretical contributions, we provide empirical results that highlight the frequency of linear quadratic dynamic games (a benchmark for multiagent reinforcement learning) that admit global Nash equilibria that are almost surely avoided by policy gradient.
引用
收藏
页码:103 / 131
页数:29
相关论文
共 50 条
  • [21] Maximizing Local Rewards on Multi-Agent Quantum Games through Gradient-Based Learning Strategies
    Silva, Agustin
    Zabaleta, Omar Gustavo
    Arizmendi, Constancio Miguel
    Lo Franco, Rosario
    [J]. ENTROPY, 2023, 25 (11)
  • [22] Direct gradient-based reinforcement learning for robot behavior learning
    El-Fakdi, Andres
    Carreras, Marc
    Ridao, Pere
    [J]. INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II, 2007, : 175 - +
  • [23] Gradient-based algorithms for finding Nash equilibria in extensive form games
    Gilpin, Andrew
    Hoda, Samid
    Pena, Javier
    Sandholm, Tuomas
    [J]. INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2007, 4858 : 57 - +
  • [24] Gradient-Based Neuromorphic Learning on Dynamical RRAM Arrays
    Zhou, Peng
    Choi, Dong-Uk
    Lu, Wei D.
    Kang, Sung-Mo
    Eshraghian, Jason K.
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (04) : 888 - 897
  • [25] Signal Propagation in a Gradient-Based and Evolutionary Learning System
    Toutouh, Jamal
    O'reilly, Una-May
    [J]. PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 377 - 385
  • [26] Global Optimality in Bivariate Gradient-based DAG Learning
    Deng, Chang
    Bello, Kevin
    Ravikumar, Pradeep
    Aragam, Bryon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Gradient-Based Feature Learning under Structured Data
    Mousavi-Hosseini, Alireza
    Wu, Denny
    Suzuki, Taiji
    Erdogdu, Murat A.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Continuous rotation invariant features for gradient-based texture classification
    Hanbay, Kazim
    Alpaslan, Nuh
    Talu, Muhammed Fatih
    Hanbay, Davut
    Karci, Ali
    Kocamaz, Adnan Fatih
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 132 : 87 - 101
  • [29] CONTINUOUS EDGE GRADIENT-BASED TEMPLATE MATCHING FOR ARTICULATED OBJECTS
    Mohr, Daniel
    Zachmann, Gabriel
    [J]. VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2009, : 519 - 524
  • [30] Dynamics of gradient-based learning and applications to hyperparameter estimation
    Wong, KYM
    Luo, PX
    Li, FL
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 369 - 376