Bayesian Incentive-Compatible Bandit Exploration

被引:16
|
作者
Mansour, Yishay [1 ]
Slivkins, Aleksandrs [2 ]
Syrgkanis, Vasilis [3 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-6997801 Tel Aviv, Israel
[2] Microsoft Res, New York, NY 10011 USA
[3] Microsoft Res, Cambridge, MA 02142 USA
关键词
mechanism design; multiarmed bandits; regret; Bayesian incentive-compatibility; CLINICAL-TRIAL DESIGN; MULTIARMED BANDIT; ALGORITHMS; SIGNATURE; REGRET; BOUNDS; ERA;
D O I
10.1287/opre.2019.1949
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
As self-interested individuals ("agents") make decisions over time, they utilize information revealed by other agents in the past and produce information that may help agents in the future. This phenomenon is common in a wide range of scenarios in the Internet economy, as well as in medical decisions. Each agent would like to exploit: select the best action given the current information, but would prefer the previous agents to explore: try out various alternatives to collect information. A social planner, by means of a carefully designed recommendation policy, can incentivize the agents to balance the exploration and exploitation so as to maximize social welfare. We model the planner's recommendation policy as a multiarm bandit algorithm under incentive-compatibility constraints induced by agents' Bayesian priors. We design a bandit algorithm which is incentive-compatible and has asymptotically optimal performance, as expressed by regret. Further, we provide a black-box reduction from an arbitrary multiarm bandit algorithm to an incentive-compatible one, with only a constant multiplicative increase in regret. This reduction works for very general bandit settings that incorporate contexts and arbitrary partial feedback.
引用
收藏
页码:1132 / 1161
页数:30
相关论文
共 50 条
  • [1] Optimal Algorithm for Bayesian Incentive-Compatible Exploration
    Cohen, Lee
    Mansour, Yishay
    ACM EC '19: PROCEEDINGS OF THE 2019 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2019, : 135 - 151
  • [2] An Incentive-Compatible Multi-Armed Bandit Mechanism
    Gonen, Rica
    Pavlov, Elan
    PODC'07: PROCEEDINGS OF THE 26TH ANNUAL ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, 2007, : 362 - 363
  • [3] On incentive-compatible estimators
    Eliaz, Kfir
    Spiegler, Ran
    GAMES AND ECONOMIC BEHAVIOR, 2022, 132 : 204 - 220
  • [4] Incentive-Compatible Diffusion
    Babichenko, Yakov
    Dean, Oren
    Tennenholtz, Moshe
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1379 - 1388
  • [5] Incentive-Compatible Classification
    Babichenko, Yakov
    Dean, Oren
    Tennenholtz, Moshe
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7055 - 7062
  • [6] A heuristic optimization of Bayesian incentive-compatible cake-cutting
    Le Nguyen Hoang
    Soumis, Francois
    Zaccour, Georges
    COMPUTERS & OPERATIONS RESEARCH, 2016, 75 : 76 - 82
  • [7] Incentive-Compatible Forecasting Competitions
    Witkowski, Jens
    Freeman, Rupert
    Vaughan, Jennifer Wortman
    Pennock, David M.
    Krausee, Andreas
    MANAGEMENT SCIENCE, 2023, 69 (03) : 1354 - 1374
  • [8] Incentive-compatible interdomain routing
    Feigenbaum, Joan
    Ramachandran, Vijay
    Schapira, Michael
    DISTRIBUTED COMPUTING, 2011, 23 (5-6) : 301 - 319
  • [9] Incentive-Compatible Diffusion Auctions
    Li, Bin
    Hao, Dong
    Zhao, Dengji
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 231 - 237
  • [10] Incentive-Compatible Forecasting Competitions
    Witkowski, Jens
    Freeman, Rupert
    Vaughan, Jennifer Wortman
    Pennock, David M.
    Krause, Andreas
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1282 - 1289