Real-life Performance of Fairness Interventions Introducing a New Benchmarking Dataset for Fair ML

被引:1
|
作者
Lenders, Daphne [1 ]
Calders, Toon [1 ]
机构
[1] Univ Antwerp, Antwerp, Belgium
关键词
Fair ML; Fairness Evaluation; Benchmarking Dataset;
D O I
10.1145/3555776.3577634
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Some researchers evaluate their fair Machine Learning (ML) algorithms by simulating data with a fair and biased version of its labels. The fair labels reflect what labels individuals deserve, while the biased labels reflect labels obtained through a biased decision process. Given such data, fair algorithms are evaluated by measuring how well they can predict the fair labels, after being trained on the biased ones. The big problem with these approaches is, that they are based on simulated data, which is unlikely to capture the full complexity and noise of real-life decision problems. In this paper, we show how we created a new, more realistic dataset with both fair and biased labels. For this purpose, we started with an existing dataset containing information about high school students and whether they passed an exam or not. Through a human experiment, where participants estimated the school performance given some description of these students, we collect a biased version of these labels. We show how this new dataset can be used to evaluate fair ML algorithms, and how some fairness interventions, that perform well in the traditional evaluation schemes, do not necessarily perform well with respect to the unbiased labels in our dataset, leading to new insights into the performance of debiasing techniques.
引用
收藏
页码:350 / 357
页数:8
相关论文
共 50 条
  • [1] COGNITIVE PLASTICITY IN REAL-LIFE INTERVENTIONS
    不详
    [J]. GERONTOLOGIST, 2013, 53 : 104 - 104
  • [2] A New RGB-D Gesture Video Dataset for Real-life Scenarios
    Lu, Zhendong
    Xiao, Guojian
    Luo, Zihao
    Jin, Panji
    Li, Kuan
    Yin, Jianping
    [J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [3] DIALOGSUM: A Real-Life Scenario Dialogue Summarization Dataset
    Chen, Yulong
    Liu, Yang
    Chen, Liang
    Zhang, Yue
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5062 - 5074
  • [4] LifeQA: A Real-life Dataset for Video Question Answering
    Castro, Santiago
    Azab, Mahmoud
    Stroud, Jonathan C.
    Noujaim, Cristina
    Wang, Ruoyao
    Deng, Jia
    Mihalcea, Rada
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4352 - 4358
  • [5] Fairness, self-interest, and cooperation in a real-life conflict
    Mueller, Markus M.
    Kals, Elisabeth
    Maes, Juergen
    [J]. JOURNAL OF APPLIED SOCIAL PSYCHOLOGY, 2008, 38 (03) : 684 - 704
  • [6] Introducing the Extraordinary Leuven Cement: Raw Materials, Process, Performance, and First Real-Life Applications
    Pontikes, Yiannis
    [J]. REWAS 2019: MANUFACTURING THE CIRCULAR MATERIALS ECONOMY, 2019, : 165 - 166
  • [7] INTRODUCING A REAL-LIFE SITUATION INTO FOREIGN-LANGUAGE CLASSROOM
    CARTON, D
    [J]. MODERN LANGUAGE JOURNAL, 1977, 61 (1-2): : 13 - 16
  • [8] A measure of fairness: An investigative framework to explore perceptions of fairness and justice in real-life social conflict
    Gross, Catherine
    [J]. HUMAN ECOLOGY REVIEW, 2008, 15 (02) : 130 - 140
  • [9] Bot Classification for Real-Life Highly Class-Imbalanced Dataset
    Harun, Sarah
    Bhuiyan, Tanveer Hossain
    Zhang, Song
    Medal, Hugh
    Bian, Linkan
    [J]. 2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 565 - 572
  • [10] A Novel Dataset for Real-Life Evaluation of Facial Expression Recognition Methodologies
    Siddiqi, Muhammad Hameed
    Ali, Maqbool
    Idris, Muhammad
    Banos, Oresti
    Lee, Sungyoung
    Choo, Hyunseung
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 89 - 95