No-Regret and Incentive-Compatible Online Learning

被引:0
|
作者
Freeman, Rupert [1 ]
Pennock, David M. [2 ]
Podimata, Chara [3 ]
Vaughan, Jennifer Wortman [4 ]
机构
[1] Univ Virginia, Darden Sch Business, Charlottesville, VA 22903 USA
[2] Rutgers State Univ, DIMACS, New Brunswick, NJ 08854 USA
[3] Harvard Univ, Cambridge, MA 02138 USA
[4] Microsoft Res NYC, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
PREDICTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study online learning settings in which experts act strategically to maximize their influence on the learning algorithm's predictions by potentially misreporting their beliefs about a sequence of binary events. Our goal is twofold. First, we want the learning algorithm to be no-regret with respect to the best fixed expert in hindsight. Second, we want incentive compatibility, a guarantee that each expert's best strategy is to report his true beliefs about the realization of each event. To achieve this goal, we build on the literature on wagering mechanisms, a type of multi-agent scoring rule. We provide algorithms that achieve no regret and incentive compatibility for myopic experts for both the full and partial information settings. In experiments on datasets from FiveThirtyEight, our algorithms have regret comparable to classic no-regret algorithms, which are not incentive-compatible. Finally, we identify an incentive-compatible algorithm for forward-looking strategic agents that exhibits diminishing regret in practice.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] No-Regret and Incentive-Compatible Online Learning
    Freeman, Rupert
    Pennock, David M.
    Podimata, Chara
    Vaughan, Jennifer Wortman
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [2] Incentive-compatible online auctions for digital goods
    Bar-Yossef, Z
    Hildrum, K
    Wu, F
    PROCEEDINGS OF THE THIRTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2002, : 964 - 970
  • [3] Online incentive-compatible mechanisms for traffic intersection auctions
    Rey, David
    Levin, Michael W.
    Dixit, Vinayak V.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 293 (01) : 229 - 247
  • [4] On incentive-compatible estimators
    Eliaz, Kfir
    Spiegler, Ran
    GAMES AND ECONOMIC BEHAVIOR, 2022, 132 : 204 - 220
  • [5] Incentive-Compatible Diffusion
    Babichenko, Yakov
    Dean, Oren
    Tennenholtz, Moshe
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1379 - 1388
  • [6] Incentive-Compatible Classification
    Babichenko, Yakov
    Dean, Oren
    Tennenholtz, Moshe
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7055 - 7062
  • [7] A Reduction from Reinforcement Learning to No-Regret Online Learning
    Cheng, Ching-An
    des Combes, Remi Tachet
    Boots, Byron
    Gordon, Geoff
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3514 - 3523
  • [8] No-regret Online Learning over Riemannian Manifolds
    Wang, Xi
    Tu, Zhipeng
    Hong, Yiguang
    Wu, Yingyi
    Shi, Guodong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Incentive-Compatible Learning of Reserve Prices for Repeated Auctions
    Kanoria, Yash
    Nazerzadeh, Hamid
    OPERATIONS RESEARCH, 2021, 69 (02) : 509 - 524
  • [10] Incentive-Compatible Online Mechanisms for Resource Provisioning and Allocation in Clouds
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Vasilakos, Athanasios V.
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 312 - 319