Strategizing against No-regret Learners

被引：0

作者：

Deng, Yuan ^{[1
]}

Schneider, Jon ^{[2
]}

Sivan, Balasubramanian ^{[2
]}

机构：

[1] Duke Univ, Durham, NC 27706 USA

[2] Google Res, Mountain View, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.

引用

页数：9

共 50 条

[1] On Fixed Convex Combinations of No-Regret Learners
Calliess, Jan-P.
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 494 - 504
[2] Playing against no-regret players
D'Andrea, Maurizio
OPERATIONS RESEARCH LETTERS, 2023, 51 (02) : 142 - 145
[3] Strategizing against Learners in Bayesian Games
Mansour, Yishay
Mohri, Mehryar
Schneider, Jon
Sivan, Balasubramanian
CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
[4] No-regret boosting
Gambin, Anna
Szczurek, Ewa
ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 1, 2007, 4431 : 422 - +
[5] No regrets about no-regret
Chang, Yu-Han
ARTIFICIAL INTELLIGENCE, 2007, 171 (07) : 434 - 439
[6] Constrained no-regret learning
Du, Ye
Lehrer, Ehud
JOURNAL OF MATHEMATICAL ECONOMICS, 2020, 88 : 16 - 24
[7] Selling to a No-Regret Buyer
Braverman, Mark
Mao, Jieming
Schneider, Jon
Weinberg, Matt
ACM EC'18: PROCEEDINGS OF THE 2018 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2018, : 523 - 538
[8] No-regret Reinforcement Learning
Gopalan, Aditya
2019 FIFTH INDIAN CONTROL CONFERENCE (ICC), 2019, : 16 - 16
[9] A wide range no-regret theorem
Lehrer, E
GAMES AND ECONOMIC BEHAVIOR, 2003, 42 (01) : 101 - 115
[10] No-Regret Slice Reservation Algorithms
Monteil, Jean-Baptiste
Iosifidis, George
DaSilva, Luiz
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,

← 1 2 3 4 5 →