Strategizing against No-regret Learners

被引：0

作者：

Deng, Yuan ^{[1
]}

Schneider, Jon ^{[2
]}

Sivan, Balasubramanian ^{[2
]}

机构：

[1] Duke Univ, Durham, NC 27706 USA

[2] Google Res, Mountain View, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.

引用

页数：9

共 50 条

[31] No-regret Exploration in Contextual Reinforcement Learning
Modi, Aditya
Tewari, Ambuj
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 829 - 838
[32] No-Regret Online Prediction with Strategic Experts
Sadeghi, Omid
Fazel, Maryam
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[33] No-Regret Linear Bandits beyond Realizability
Liu, Chong
Yin, Ming
Wang, Yu-Xiang
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1294 - 1303
[34] No-Regret Caching via Online Mirror Descent
Salem, Tareq Si
Neglia, Giovanni
Ioannidis, Straus
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[35] Doubly Optimal No-Regret Learning in Monotone Games
Cai, Yang
Zheng, Weiqiang
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[36] No-Regret and Incentive-Compatible Online Learning
Freeman, Rupert
Pennock, David M.
Podimata, Chara
Vaughan, Jennifer Wortman
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[37] Mechanisms for a No-Regret Agent: Beyond the Common Prior
Camara, Modibo K.
Hartline, Jason D.
Johnsen, Aleck
2020 IEEE 61ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2020), 2020, : 259 - 270
[38] No-Regret Learning in Partially-Informed Auctions
Guo, Wenshuo
Jordan, Michael I.
Vitercik, Ellen
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[39] No-Regret Learning for Coalitional Model Predictive Control
Chanfreut, P.
Maestre, J. M.
Zhu, Q.
Camacho, E. F.
IFAC PAPERSONLINE, 2020, 53 (02): : 3439 - 3444
[40] No-Regret Learning and Equilibrium Computation in Quantum Games
Lin, Wayne
Piliouras, Georgios
Sim, Ryann
Varvitsiotis, Antonios
QUANTUM, 2024, 8

← 1 2 3 4 5 →