On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

被引:0
|
作者
Anagnostides, Ioannis [1 ]
Panageas, Ioannis [2 ]
Farina, Gabriele [3 ]
Sandholm, Tuomas [1 ,4 ,5 ,6 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Irvine, Irvine, CA USA
[3] MIT, Cambridge, MA USA
[4] Strateg Machine Inc, Charlotte, NC USA
[5] Strategy Robot Inc, Pittsburgh, PA USA
[6] Optimized Markets Inc, Pittsburgh, PA USA
基金
美国国家科学基金会;
关键词
POKER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the literature on learning in games has focused on the restrictive setting where the underlying repeated game does not change over time. Much less is known about the convergence of no-regret learning algorithms in dynamic multi-agent settings. In this paper, we characterize the convergence of optimistic gradient descent (OGD) in time-varying games. Our framework yields sharp convergence bounds for the equilibrium gap of OGD in zero-sum games parameterized on natural variation measures of the sequence of games, subsuming known results for static games. Furthermore, we establish improved second-order variation bounds under strong convexity-concavity, as long as each game is repeated multiple times. Our results also extend to time-varying general-sum multi-player games via a bilinear formulation of correlated equilibria, which has novel implications for meta-learning and for obtaining refined variation-dependent regret bounds, addressing questions left open in prior papers. Finally, we leverage our framework to also provide new insights on dynamic regret guarantees in static games.
引用
收藏
页数:39
相关论文
共 50 条
  • [41] Online Optimisation for Online Learning and Control - From No-Regret to Generalised Error Convergence
    Calliess, J.
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 2480 - 2485
  • [42] No-Regret Distributed Learning in Two-Network Zero-Sum Games
    Huang, Shijie
    Lei, Jinlong
    Hong, Yiguang
    Shanbhag, Uday, V
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 924 - 929
  • [43] No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation
    Hsieh, Yu-Guan
    Antonakopoulos, Kimon
    Cevher, Volkan
    Mertikopoulos, Panayotis
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [44] Acceleration through Optimistic No-Regret Dynamics
    Wang, Jun-Kun
    Abernethy, Jacob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [45] Distributed algorithms with linear convergence for aggregative games over time-varying networks
    Zhu, Rui
    Wang, Fuyong
    Liu, Zhongxin
    Chen, Zengqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [46] No-Regret Learning Supports Voters' Competence
    Spelda, Petr
    Stritecky, Vit
    Symons, John
    SOCIAL EPISTEMOLOGY, 2024, 38 (05) : 543 - 559
  • [47] Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions
    Feng, Zhe
    Guruganesh, Guru
    Liaw, Christopher
    Mehta, Aranyak
    Sethi, Abhishek
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5399 - 5406
  • [48] Weighted Voting Via No-Regret Learning
    Haghtalab, Nika
    Noothigattu, Ritesh
    Procaccia, Ariel D.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1055 - 1062
  • [49] No-regret Exploration in Contextual Reinforcement Learning
    Modi, Aditya
    Tewari, Ambuj
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 829 - 838
  • [50] Asynchronous and Time-Varying Proximal Type Dynamics in Multiagent Network Games
    Cenedese, Carlo
    Belgioioso, Giuseppe
    Kawano, Yu
    Grammatico, Sergio
    Cao, Ming
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2861 - 2867