Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification

被引:132
|
作者
Borkan, Daniel [1 ]
Dixon, Lucas [1 ]
Sorensen, Jeffrey [1 ]
Thain, Nithum [1 ]
Vasserman, Lucy [1 ]
机构
[1] Jigsaw, Bellevue, WA USA
关键词
D O I
10.1145/3308560.3317593
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Unintended bias in Machine Learning can manifest as systemic differences in performance for different demographic groups, potentially compounding existing challenges to fairness in society at large. In this paper, we introduce a suite of threshold-agnostic metrics that provide a nuanced view of this unintended bias, by considering the various ways that a classifier's score distribution can vary across designated groups. We also introduce a large new test set of online comments with crowd-sourced annotations for identity references. We use this to show how our metrics can be used to find new and potentially subtle unintended bias in existing public models.
引用
收藏
页码:491 / 500
页数:10
相关论文
共 50 条
  • [1] Measuring and Mitigating Unintended Bias in Text Classification
    Dixon, Lucas
    Li, John
    Sorensen, Jeffrey
    Thain, Nithum
    Vasserman, Lucy
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 67 - 73
  • [2] Mitigating Multi-class Unintended Demographic Bias in Text Classification with Adversarial Learning
    Pan, Le
    Yao, Lina
    Zhang, Wenjie
    Wang, Xianzhi
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 386 - 394
  • [3] Bias analysis in text classification for highly skewed data
    Tang, L
    Liu, H
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 781 - 784
  • [4] Avoiding Unintended Bias in Toxicity Classification with Neural Networks
    Morzhov, Sergey
    PROCEEDINGS OF THE 26TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2020, : 314 - 320
  • [5] (Dis)Incentivizing Patient Satisfaction Metrics: The Unintended Consequences of Institutional Bias
    Sotto-Santiago, Sylk
    Slaven, James E.
    Rohr-Kirchgraber, Theresa
    HEALTH EQUITY, 2019, 3 (01) : 13 - 18
  • [6] Measuring metrics: what diversity indicators are most appropriate for different forms of data bias?
    Qiao, Huijie
    Orr, Michael C.
    Hughes, Alice C.
    ECOGRAPHY, 2024, 2024 (09)
  • [7] Active learning in automated text classification: a case study exploring bias in predicted model performance metrics
    Varghese A.
    Hong T.
    Hunter C.
    Agyeman-Badu G.
    Cawley M.
    Environment Systems and Decisions, 2019, 39 (3) : 269 - 280
  • [8] Evaluation metrics for measuring bias in search engine results
    Gizem Gezici
    Aldo Lipani
    Yucel Saygin
    Emine Yilmaz
    Information Retrieval Journal, 2021, 24 : 85 - 113
  • [9] Evaluation metrics for measuring bias in search engine results
    Gezici, Gizem
    Lipani, Aldo
    Saygin, Yucel
    Yilmaz, Emine
    INFORMATION RETRIEVAL JOURNAL, 2021, 24 (02): : 85 - 113
  • [10] Clinical Text Classification in Cancer Real-World Data in Spanish
    Moreno-Barea, Francisco J.
    Mesa, Hector
    Ribelles, Nuria
    Alba, Emilio
    Jerez, Jose M.
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2023, PT I, 2023, 13919 : 482 - 496