Fairness in Learning: Classic and Contextual Bandits

被引：0

作者：

Joseph, Matthew ^{[1
]}

Kearns, Michael ^{[1
]}

Morgenstern, Jamie ^{[1
]}

Roth, Aaron ^{[1
]}

机构：

[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016) | 2016年 / 29卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the study of fairness in multi-armed bandit problems. Our fairness definition demands that, given a pool of applicants, a worse applicant is never favored over a better one, despite a learning algorithm's uncertainty over the true payoffs. In the classic stochastic bandits problem we provide a provably fair algorithm based on "chained" confidence intervals, and prove a cumulative regret bound with a cubic dependence on the number of arms. We further show that any fair algorithm must have such a dependence, providing a strong separation between fair and unfair learning that extends to the general contextual case. In the general contextual case, we prove a tight connection between fairness and the KWIK (Knows What It Knows) learning model: a KWIK algorithm for a class of functions can be transformed into a provably fair contextual bandit algorithm and vice versa. This tight connection allows us to provide a provably fair algorithm for the linear contextual bandit problem with a polynomial dependence on the dimension, and to show (for a different class of functions) a worst-case exponential gap in regret between fair and non-fair learning algorithms.

引用

页数：9

共 50 条

[1] Meritocratic Fairness for Infinite and Contextual Bandits
Joseph, Matthew
Kearns, Michael
Morgenstern, Jamie
Neel, Seth
Roth, Aaron
PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 158 - 163
[2] Group Meritocratic Fairness in Linear Contextual Bandits
Grazzi, Riccardo
Akhavan, Arya
Falk, John Isak Texas
Cella, Leonardo
Pontil, Massimiliano
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[3] Offline Contextual Bandits with High Probability Fairness Guarantees
Metevier, Blossom
Giguere, Stephen
Brockman, Sarah
Kobren, Ari
Brun, Yuriy
Brunskill, Emma
Thomas, Philip S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[4] Achieving User-Side Fairness in Contextual Bandits
Wen Huang
Kevin Labille
Xintao Wu
Dongwon Lee
Neil Heffernan
Human-Centric Intelligent Systems, 2022, 2 (3-4): : 81 - 94
[5] Metric-Free Individual Fairness with Cooperative Contextual Bandits
Hu, Qian
Rangwala, Huzefa
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 182 - 191
[6] Contextual Bandits with Cross-Learning
Balseiro, Santiago
Golrezaei, Negin
Mahdian, Mohammad
Mirrokni, Vahab
Schneider, Jon
MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1607 - 1629
[7] AdaLinUCB: Opportunistic Learning for Contextual Bandits
Guo, Xueying
Wang, Xiaoxiao
Liu, Xin
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2420 - 2427
[8] Contextual Bandits With Cross-Learning
Balseiro, Santiago
Golrezaei, Negin
Mahdian, Mohammad
Mirrokni, Vahab
Schneider, Jon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[9] Learning Hidden Features for Contextual Bandits
Wang, Huazheng
Wu, Qingyun
Wang, Hongning
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1633 - 1642
[10] BanditRank: Learning to Rank Using Contextual Bandits
Gampa, Phanideep
Fujita, Sumio
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 259 - 271

← 1 2 3 4 5 →