Fairness in Learning: Classic and Contextual Bandits

被引:0
|
作者
Joseph, Matthew [1 ]
Kearns, Michael [1 ]
Morgenstern, Jamie [1 ]
Roth, Aaron [1 ]
机构
[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the study of fairness in multi-armed bandit problems. Our fairness definition demands that, given a pool of applicants, a worse applicant is never favored over a better one, despite a learning algorithm's uncertainty over the true payoffs. In the classic stochastic bandits problem we provide a provably fair algorithm based on "chained" confidence intervals, and prove a cumulative regret bound with a cubic dependence on the number of arms. We further show that any fair algorithm must have such a dependence, providing a strong separation between fair and unfair learning that extends to the general contextual case. In the general contextual case, we prove a tight connection between fairness and the KWIK (Knows What It Knows) learning model: a KWIK algorithm for a class of functions can be transformed into a provably fair contextual bandit algorithm and vice versa. This tight connection allows us to provide a provably fair algorithm for the linear contextual bandit problem with a polynomial dependence on the dimension, and to show (for a different class of functions) a worst-case exponential gap in regret between fair and non-fair learning algorithms.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Meritocratic Fairness for Infinite and Contextual Bandits
    Joseph, Matthew
    Kearns, Michael
    Morgenstern, Jamie
    Neel, Seth
    Roth, Aaron
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 158 - 163
  • [2] Group Meritocratic Fairness in Linear Contextual Bandits
    Grazzi, Riccardo
    Akhavan, Arya
    Falk, John Isak Texas
    Cella, Leonardo
    Pontil, Massimiliano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Offline Contextual Bandits with High Probability Fairness Guarantees
    Metevier, Blossom
    Giguere, Stephen
    Brockman, Sarah
    Kobren, Ari
    Brun, Yuriy
    Brunskill, Emma
    Thomas, Philip S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Achieving User-Side Fairness in Contextual Bandits
    Wen Huang
    Kevin Labille
    Xintao Wu
    Dongwon Lee
    Neil Heffernan
    Human-Centric Intelligent Systems, 2022, 2 (3-4): : 81 - 94
  • [5] Metric-Free Individual Fairness with Cooperative Contextual Bandits
    Hu, Qian
    Rangwala, Huzefa
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 182 - 191
  • [6] Contextual Bandits with Cross-Learning
    Balseiro, Santiago
    Golrezaei, Negin
    Mahdian, Mohammad
    Mirrokni, Vahab
    Schneider, Jon
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1607 - 1629
  • [7] AdaLinUCB: Opportunistic Learning for Contextual Bandits
    Guo, Xueying
    Wang, Xiaoxiao
    Liu, Xin
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2420 - 2427
  • [8] Contextual Bandits With Cross-Learning
    Balseiro, Santiago
    Golrezaei, Negin
    Mahdian, Mohammad
    Mirrokni, Vahab
    Schneider, Jon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] Learning Hidden Features for Contextual Bandits
    Wang, Huazheng
    Wu, Qingyun
    Wang, Hongning
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1633 - 1642
  • [10] BanditRank: Learning to Rank Using Contextual Bandits
    Gampa, Phanideep
    Fujita, Sumio
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 259 - 271