No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation

被引：0

作者：

Hsieh, Yu-Guan ^{[1
]}

Antonakopoulos, Kimon ^{[2
]}

Cevher, Volkan ^{[2
]}

Mertikopoulos, Panayotis ^{[1
,3
,4
]}

机构：

[1] Univ Grenoble Alpes, Grenoble, France

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] CNRS, Inria, LIG, Paris, France

[4] Criteo AI Lab, Ann Arbor, MI USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年

基金：

瑞士国家科学基金会; 欧洲研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all convex-concave and monotone games), and when the players only have access to noisy estimates of their individual payoff gradients. If the noise is additive, the game-theoretic and purely adversarial settings enjoy similar regret guarantees; however, if the noise is multiplicative, we show that the learners can, in fact, achieve constant regret. We achieve this faster rate via an optimistic gradient scheme with learning rate separation - that is, the method's extrapolation and update steps are tuned to different schedules, depending on the noise profile. Subsequently, to eliminate the need for delicate hyperparameter tuning, we propose a fully adaptive method that attains nearly the same guarantees as its non-adapted counterpart, while operating without knowledge of either the game or of the noise profile.

引用

页数：13

共 37 条

[21] Multiplicative Updates Outperform Generic No-Regret Learning in Congestion Games
Kleinberg, Robert
Piliouras, Georgios
Tardos, Eva
STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 533 - 542
[22] Robustness enhancement of complex networks via No-Regret learning
Sohn, Insoo
ICT EXPRESS, 2019, 5 (03): : 163 - 166
[23] Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback
Jordan, Michael
Lin, Tianyi
Zhou, Zhengyuan
OPERATIONS RESEARCH, 2024,
[24] No-regret learning for repeated non-cooperative games with lossy bandits
Liu, Wenting
Lei, Jinlong
Yi, Peng
Hong, Yiguang
AUTOMATICA, 2024, 160
[25] On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games
Anagnostides, Ioannis
Kalavasis, Alkis
Sandholm, Tuomas
Zampetakis, Manolis
15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,
[26] No-Regret Learning in Time-Varying Zero-Sum Games
Zhang, Mengxiao
Zhao, Peng
Luo, Haipeng
Zhou, Zhi-Hua
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[27] Near-Optimal No-Regret Learning Dynamics for General Convex Games
Farina, Gabriele
Anagnostides, Ioannis
Luo, Haipeng
Lee, Chung-Wei
Kroer, Christian
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[28] Tighter Robust Upper Bounds for Options via No-Regret Learning
Xue, Shan
Du, Ye
Xu, Liang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5348 - 5356
[29] No-Regret Learning in Bilateral Trade via Global Budget Balance
Bernasconi, Martino
Castiglioni, Matteo
Celli, Andrea
Fusco, Federico
PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 247 - 258
[30] No-Regret Distributed Learning in Two-Network Zero-Sum Games
Huang, Shijie
Lei, Jinlong
Hong, Yiguang
Shanbhag, Uday, V
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 924 - 929

← 1 2 3 4 →