Online Learning with an Unknown Fairness Metric

被引:0
|
作者
Gillen, Stephen [1 ]
Jung, Christopher [1 ]
Kearns, Michael [1 ]
Roth, Aaron [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of online learning in the linear contextual bandits setting, but in which there are also strong individual fairness constraints governed by an unknown similarity metric. These constraints demand that we select similar actions or individuals with approximately equal probability [?], which may be at odds with optimizing reward, thus modeling settings where profit and social policy are in tension. We assume we learn about an unknown Mahalanobis similarity metric from only weak feedback that identifies fairness violations, but does not quantify their extent. This is intended to represent the interventions of a regulator who "knows unfairness when he sees it" but nevertheless cannot enunciate a quantitative fairness metric over individuals. Our main result is an algorithm in the adversarial context setting that has a number of fairness violations that depends only logarithmically on T, while obtaining an optimal O (root T) regret bound to the best fair policy.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Metric-Free Individual Fairness in Online Learning
    Bechavod, Yahav
    Jung, Christopher
    Wu, Zhiwei Steven
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] A Multilayer Framework for Online Metric Learning
    Li, Wenbin
    Liu, Yanfang
    Huo, Jing
    Shi, Yinghuan
    Gao, Yang
    Wang, Lei
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, : 6701 - 6713
  • [3] Online Heterogeneous Transfer Metric Learning
    Luo, Yong
    Liu, Tongliang
    Wen, Yonggang
    Tao, Dacheng
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2525 - 2531
  • [4] Online Learning in Unknown Markov Games
    Tian, Yi
    Wang, Yuanhao
    Yu, Tiancheng
    Sra, Suvrit
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7290 - 7300
  • [5] Bandit Online Learning with Unknown Delays
    Li, Bingcong
    Chen, Tianyi
    Giannakis, Georgios B.
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [6] Fairness-Aware Online Meta-learning
    Zhao, Chen
    Chen, Feng
    Thuraisingham, Bhavani
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2294 - 2304
  • [7] Equity and Fairness Challenges in Online Learning in the Age of ChatGPT
    Jamil, Hasan M.
    [J]. 39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 91 - 92
  • [8] Scalable Large Margin Online Metric Learning
    Zhong, Guoqiang
    Zheng, Yan
    Li, Sheng
    Fu, Yun
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 2252 - 2259
  • [9] Online Metric Learning for an Adaptation to Confidence Drift
    Fischer, Lydia
    Hammer, Barbara
    Wersing, Heiko
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 748 - 755
  • [10] Online learning using a Bayesian surprise metric
    Hasanbelliu, Erion
    Kampa, Kittipat
    Principe, Jose C.
    Cobb, James T.
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,