No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation

被引:0
|
作者
Hsieh, Yu-Guan [1 ]
Antonakopoulos, Kimon [2 ]
Cevher, Volkan [2 ]
Mertikopoulos, Panayotis [1 ,3 ,4 ]
机构
[1] Univ Grenoble Alpes, Grenoble, France
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[3] CNRS, Inria, LIG, Paris, France
[4] Criteo AI Lab, Ann Arbor, MI USA
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all convex-concave and monotone games), and when the players only have access to noisy estimates of their individual payoff gradients. If the noise is additive, the game-theoretic and purely adversarial settings enjoy similar regret guarantees; however, if the noise is multiplicative, we show that the learners can, in fact, achieve constant regret. We achieve this faster rate via an optimistic gradient scheme with learning rate separation - that is, the method's extrapolation and update steps are tuned to different schedules, depending on the noise profile. Subsequently, to eliminate the need for delicate hyperparameter tuning, we propose a fully adaptive method that attains nearly the same guarantees as its non-adapted counterpart, while operating without knowledge of either the game or of the noise profile.
引用
收藏
页数:13
相关论文
共 37 条
  • [31] A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
    Carminati, Luca
    Cacciamani, Federico
    Ciccone, Marco
    Gatti, Nicola
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-player General-Sum Games
    Anagnostides, Ioannis
    Daskalakis, Constantinos
    Farina, Gabriele
    Fishelson, Maxwell
    Golowich, Noah
    Sandholm, Tuomas
    PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 736 - 749
  • [33] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
    Dai, Zhongxiang
    Chen, Yizhou
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    Ho, Teck-Hua
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [34] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
    Dai, Zhongxiang
    Chen, Yizhou
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    Ho, Teck-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [35] Achieving Logarithmic Regret via Hints in Online Learning of Noisy LQR Systems
    Akbari, Mohammad
    Gharesifard, Bahman
    Linder, Tamas
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4700 - 4705
  • [36] Channel Selection for Network-Assisted D2D Communication via No-Regret Bandit Learning With Calibrated Forecasting
    Maghsudi, Setareh
    Stanczak, Slawomir
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (03) : 1309 - 1322
  • [37] Fault-Tolerant H∞ Control for Topside Separation Systems via Output-Feedback Reinforcement Learning
    Zhang, Yuguang
    Luo, Xiaoyuan
    Li, Shaobao
    Wang, Juan
    Yang, Zhenyu
    Guan, Xinping
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025,