No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation

被引：0

作者：

Hsieh, Yu-Guan ^{[1
]}

Antonakopoulos, Kimon ^{[2
]}

Cevher, Volkan ^{[2
]}

Mertikopoulos, Panayotis ^{[1
,3
,4
]}

机构：

[1] Univ Grenoble Alpes, Grenoble, France

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] CNRS, Inria, LIG, Paris, France

[4] Criteo AI Lab, Ann Arbor, MI USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年

基金：

瑞士国家科学基金会; 欧洲研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all convex-concave and monotone games), and when the players only have access to noisy estimates of their individual payoff gradients. If the noise is additive, the game-theoretic and purely adversarial settings enjoy similar regret guarantees; however, if the noise is multiplicative, we show that the learners can, in fact, achieve constant regret. We achieve this faster rate via an optimistic gradient scheme with learning rate separation - that is, the method's extrapolation and update steps are tuned to different schedules, depending on the noise profile. Subsequently, to eliminate the need for delicate hyperparameter tuning, we propose a fully adaptive method that attains nearly the same guarantees as its non-adapted counterpart, while operating without knowledge of either the game or of the noise profile.

引用

页数：13

共 37 条

[31] A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
Carminati, Luca
Cacciamani, Federico
Ciccone, Marco
Gatti, Nicola
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[32] Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-player General-Sum Games
Anagnostides, Ioannis
Daskalakis, Constantinos
Farina, Gabriele
Fishelson, Maxwell
Golowich, Noah
Sandholm, Tuomas
PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 736 - 749
[33] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
Dai, Zhongxiang
Chen, Yizhou
Low, Bryan Kian Hsiang
Jaillet, Patrick
Ho, Teck-Hua
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[34] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
Dai, Zhongxiang
Chen, Yizhou
Low, Bryan Kian Hsiang
Jaillet, Patrick
Ho, Teck-Hua
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[35] Achieving Logarithmic Regret via Hints in Online Learning of Noisy LQR Systems
Akbari, Mohammad
Gharesifard, Bahman
Linder, Tamas
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4700 - 4705
[36] Channel Selection for Network-Assisted D2D Communication via No-Regret Bandit Learning With Calibrated Forecasting
Maghsudi, Setareh
Stanczak, Slawomir
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (03) : 1309 - 1322
[37] Fault-Tolerant H∞ Control for Topside Separation Systems via Output-Feedback Reinforcement Learning
Zhang, Yuguang
Luo, Xiaoyuan
Li, Shaobao
Wang, Juan
Yang, Zhenyu
Guan, Xinping
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025,

← 1 2 3 4 →