Strategizing against No-regret Learners

被引:0
|
作者
Deng, Yuan [1 ]
Schneider, Jon [2 ]
Sivan, Balasubramanian [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Google Res, Mountain View, CA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) | 2019年 / 32卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] No-regret Exploration in Contextual Reinforcement Learning
    Modi, Aditya
    Tewari, Ambuj
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 829 - 838
  • [32] No-Regret Online Prediction with Strategic Experts
    Sadeghi, Omid
    Fazel, Maryam
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] No-Regret Linear Bandits beyond Realizability
    Liu, Chong
    Yin, Ming
    Wang, Yu-Xiang
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1294 - 1303
  • [34] No-Regret Caching via Online Mirror Descent
    Salem, Tareq Si
    Neglia, Giovanni
    Ioannidis, Straus
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [35] Doubly Optimal No-Regret Learning in Monotone Games
    Cai, Yang
    Zheng, Weiqiang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [36] No-Regret and Incentive-Compatible Online Learning
    Freeman, Rupert
    Pennock, David M.
    Podimata, Chara
    Vaughan, Jennifer Wortman
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [37] Mechanisms for a No-Regret Agent: Beyond the Common Prior
    Camara, Modibo K.
    Hartline, Jason D.
    Johnsen, Aleck
    2020 IEEE 61ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2020), 2020, : 259 - 270
  • [38] No-Regret Learning in Partially-Informed Auctions
    Guo, Wenshuo
    Jordan, Michael I.
    Vitercik, Ellen
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [39] No-Regret Learning for Coalitional Model Predictive Control
    Chanfreut, P.
    Maestre, J. M.
    Zhu, Q.
    Camacho, E. F.
    IFAC PAPERSONLINE, 2020, 53 (02): : 3439 - 3444
  • [40] No-Regret Learning and Equilibrium Computation in Quantum Games
    Lin, Wayne
    Piliouras, Georgios
    Sim, Ryann
    Varvitsiotis, Antonios
    QUANTUM, 2024, 8