Online Learning with an Unknown Fairness Metric

被引:0
|
作者
Gillen, Stephen [1 ]
Jung, Christopher [1 ]
Kearns, Michael [1 ]
Roth, Aaron [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of online learning in the linear contextual bandits setting, but in which there are also strong individual fairness constraints governed by an unknown similarity metric. These constraints demand that we select similar actions or individuals with approximately equal probability [?], which may be at odds with optimizing reward, thus modeling settings where profit and social policy are in tension. We assume we learn about an unknown Mahalanobis similarity metric from only weak feedback that identifies fairness violations, but does not quantify their extent. This is intended to represent the interventions of a regulator who "knows unfairness when he sees it" but nevertheless cannot enunciate a quantitative fairness metric over individuals. Our main result is an algorithm in the adversarial context setting that has a number of fairness violations that depends only logarithmically on T, while obtaining an optimal O (root T) regret bound to the best fair policy.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Tracklet Association with Online Target-Specific Metric Learning
    Wang, Bing
    Wang, Gang
    Chan, Kap Luk
    Wang, Li
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1234 - 1241
  • [32] Deep Metric Learning with Online Hard Mining for Hyperspectral Classification
    Dong, Yanni
    Yang, Cong
    Zhang, Yuxiang
    [J]. REMOTE SENSING, 2021, 13 (07)
  • [33] Passive-aggressive online distance metric learning and extensions
    Perez-Suay, Adrian
    Ferri, Francesc J.
    Arevalillo-Herraez, Miguel
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2013, 2 (01) : 85 - 96
  • [34] An Iterative Online Approach to Safe Learning in Unknown Constrained Environments
    Minh Vu
    Zeng, Shen
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7330 - 7335
  • [35] Online learning of multiple perceptual models for navigation in unknown terrain
    Grudic, Greg
    Mulligan, Jane
    Otte, Michael
    Bates, Adam
    [J]. FIELD AND SERVICE ROBOTICS: RESULTS OF THE 6TH INTERNATIONAL CONFERENCE, 2008, 42 : 411 - 420
  • [36] Preparing for the Unknown: Learning a Universal Policy with Online System Identification
    Yu, Wenhao
    Tan, Jie
    Liu, C. Karen
    Turk, Greg
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [37] A Structured Online Learning Approach to Nonlinear Tracking with Unknown Dynamics
    Farsi, Milad
    Liu, Jun
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 2205 - 2211
  • [38] A metric of fairness for parallel job schedulers
    Ngubiri, John
    van Vliet, Mario
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (12): : 1525 - 1546
  • [39] Quadratic Metric Elicitation for Fairness and Beyond
    Hiranandani, Gaurush
    Mathur, Jatin
    Narasimhan, Harikrishna
    Koyejo, Oluwasanmi
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 811 - 821
  • [40] Slowdown as a Metric for Congestion Control Fairness
    Zapletal, Adrian
    Kuipers, Fernando
    [J]. PROCEEDINGS OF THE 22ND ACM WORKSHOP ON HOT TOPICS IN NETWORKS, HOTNETS 2023, 2023, : 205 - 212