No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation

被引:0
|
作者
Hsieh, Yu-Guan [1 ]
Antonakopoulos, Kimon [2 ]
Cevher, Volkan [2 ]
Mertikopoulos, Panayotis [1 ,3 ,4 ]
机构
[1] Univ Grenoble Alpes, Grenoble, France
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[3] CNRS, Inria, LIG, Paris, France
[4] Criteo AI Lab, Ann Arbor, MI USA
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all convex-concave and monotone games), and when the players only have access to noisy estimates of their individual payoff gradients. If the noise is additive, the game-theoretic and purely adversarial settings enjoy similar regret guarantees; however, if the noise is multiplicative, we show that the learners can, in fact, achieve constant regret. We achieve this faster rate via an optimistic gradient scheme with learning rate separation - that is, the method's extrapolation and update steps are tuned to different schedules, depending on the noise profile. Subsequently, to eliminate the need for delicate hyperparameter tuning, we propose a fully adaptive method that attains nearly the same guarantees as its non-adapted counterpart, while operating without knowledge of either the game or of the noise profile.
引用
收藏
页数:13
相关论文
共 37 条
  • [21] Multiplicative Updates Outperform Generic No-Regret Learning in Congestion Games
    Kleinberg, Robert
    Piliouras, Georgios
    Tardos, Eva
    STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 533 - 542
  • [22] Robustness enhancement of complex networks via No-Regret learning
    Sohn, Insoo
    ICT EXPRESS, 2019, 5 (03): : 163 - 166
  • [23] Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback
    Jordan, Michael
    Lin, Tianyi
    Zhou, Zhengyuan
    OPERATIONS RESEARCH, 2024,
  • [24] No-regret learning for repeated non-cooperative games with lossy bandits
    Liu, Wenting
    Lei, Jinlong
    Yi, Peng
    Hong, Yiguang
    AUTOMATICA, 2024, 160
  • [25] On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games
    Anagnostides, Ioannis
    Kalavasis, Alkis
    Sandholm, Tuomas
    Zampetakis, Manolis
    15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,
  • [26] No-Regret Learning in Time-Varying Zero-Sum Games
    Zhang, Mengxiao
    Zhao, Peng
    Luo, Haipeng
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [27] Near-Optimal No-Regret Learning Dynamics for General Convex Games
    Farina, Gabriele
    Anagnostides, Ioannis
    Luo, Haipeng
    Lee, Chung-Wei
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [28] Tighter Robust Upper Bounds for Options via No-Regret Learning
    Xue, Shan
    Du, Ye
    Xu, Liang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5348 - 5356
  • [29] No-Regret Learning in Bilateral Trade via Global Budget Balance
    Bernasconi, Martino
    Castiglioni, Matteo
    Celli, Andrea
    Fusco, Federico
    PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 247 - 258
  • [30] No-Regret Distributed Learning in Two-Network Zero-Sum Games
    Huang, Shijie
    Lei, Jinlong
    Hong, Yiguang
    Shanbhag, Uday, V
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 924 - 929