Strategizing against No-regret Learners

被引:0
|
作者
Deng, Yuan [1 ]
Schneider, Jon [2 ]
Sivan, Balasubramanian [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Google Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] On Fixed Convex Combinations of No-Regret Learners
    Calliess, Jan-P.
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 494 - 504
  • [2] Playing against no-regret players
    D'Andrea, Maurizio
    OPERATIONS RESEARCH LETTERS, 2023, 51 (02) : 142 - 145
  • [3] Strategizing against Learners in Bayesian Games
    Mansour, Yishay
    Mohri, Mehryar
    Schneider, Jon
    Sivan, Balasubramanian
    CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
  • [4] No-regret boosting
    Gambin, Anna
    Szczurek, Ewa
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 1, 2007, 4431 : 422 - +
  • [5] No regrets about no-regret
    Chang, Yu-Han
    ARTIFICIAL INTELLIGENCE, 2007, 171 (07) : 434 - 439
  • [6] Constrained no-regret learning
    Du, Ye
    Lehrer, Ehud
    JOURNAL OF MATHEMATICAL ECONOMICS, 2020, 88 : 16 - 24
  • [7] Selling to a No-Regret Buyer
    Braverman, Mark
    Mao, Jieming
    Schneider, Jon
    Weinberg, Matt
    ACM EC'18: PROCEEDINGS OF THE 2018 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2018, : 523 - 538
  • [8] No-regret Reinforcement Learning
    Gopalan, Aditya
    2019 FIFTH INDIAN CONTROL CONFERENCE (ICC), 2019, : 16 - 16
  • [9] A wide range no-regret theorem
    Lehrer, E
    GAMES AND ECONOMIC BEHAVIOR, 2003, 42 (01) : 101 - 115
  • [10] No-Regret Slice Reservation Algorithms
    Monteil, Jean-Baptiste
    Iosifidis, George
    DaSilva, Luiz
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,