Strategizing against No-regret Learners

被引：0

作者：

Deng, Yuan ^{[1
]}

Schneider, Jon ^{[2
]}

Sivan, Balasubramanian ^{[2
]}

机构：

[1] Duke Univ, Durham, NC 27706 USA

[2] Google Res, Mountain View, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.

引用

页数：9

共 50 条

[41] No-regret Online Learning over Riemannian Manifolds
Wang, Xi
Tu, Zhipeng
Hong, Yiguang
Wu, Yingyi
Shi, Guodong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[42] No-Regret and Incentive-Compatible Online Learning
Freeman, Rupert
Pennock, David M.
Podimata, Chara
Vaughan, Jennifer Wortman
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[43] No-regret Caching via Online Mirror Descent
Salem, Tareq Si
Neglia, Giovanni
Ioannidis, Stratis
ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2023, 8 (04)
[44] No-Regret Learning in Unknown Games with Correlated Payoffs
Sessa, Pier Giuseppe
Bogunovic, Ilija
Kamgarpour, Maryam
Krause, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[45] No-regret control for nonlinear distributed systems with incomplete data
Nakoulima, O
Omrane, A
Velin, J
JOURNAL DE MATHEMATIQUES PURES ET APPLIQUEES, 2002, 81 (11): : 1161 - 1189
[46] Sampling Equilibria: Fast No-Regret Learning in Structured Games
Beaglehole, Daniel
Hopkins, Max
Kane, Daniel
Liu, Sihan
Lovett, Shachar
PROCEEDINGS OF THE 2023 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2023, : 3817 - 3855
[47] No-Regret Algorithms for Time-Varying Bayesian Optimization
Zhou, Xingyu
Shroff, Ness
2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
[48] Distributed no-regret edge resource allocation with limited communication
Kriouile, Saad
Tsilimantos, Dimitrios
Giannakas, Theodoros
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[49] Robustness enhancement of complex networks via No-Regret learning
Sohn, Insoo
ICT EXPRESS, 2019, 5 (03): : 163 - 166
[50] No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Zhuang, Vincent
Sui, Yanan
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130

← 1 2 3 4 5 →