Bias Mimicking: A Simple Sampling Approach for Bias Mitigation

被引:6
|
作者
Qraitem, Maan [1 ]
Saenko, Kate [1 ,2 ]
Plummer, Bryan A. [1 ]
机构
[1] Boston Univ, Boston, MA 02215 USA
[2] MIT IBM Watson Lab, Cambridge, MA USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52729.2023.01945
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior work has shown that Visual Recognition datasets frequently underrepresent bias groups B (e.g. Female) within class labels Y (e.g. Programmers). This dataset bias can lead to models that learn spurious correlations between class labels and bias groups such as age, gender, or race. Most recent methods that address this problem require significant architectural changes or additional loss functions requiring more hyper-parameter tuning. Alternatively, data sampling baselines from the class imbalance literature (e.g. Undersampling, Upweighting), which can often be implemented in a single line of code and often have no hyper-parameters, offer a cheaper and more efficient solution. However, these methods suffer from significant shortcomings. For example, Undersampling drops a significant part of the input distribution per epoch while Oversampling repeats samples, causing overfitting. To address these shortcomings, we introduce a new class-conditioned sampling method: Bias Mimicking. The method is based on the observation that if a class c bias distribution, i.e. PD(B| Y = c) is mimicked across every c ' not equal c, then Y and B are statistically independent. Using this notion, BM, through a novel training procedure, ensures that the model is exposed to the entire distribution per epoch without repeating samples. Consequently, Bias Mimicking improves underrepresented groups' accuracy of sampling methods by 3% over four benchmarks while maintaining and sometimes improving performance over nonsampling methods. Code: https: //github.com/mqraitem/Bias-Mimicking
引用
收藏
页码:20311 / 20320
页数:10
相关论文
共 50 条
  • [1] Sampling bias in NLU models: Impact and Mitigation
    Li, Zefei
    Ramakrishna, Anil
    Rumshisky, Anna
    Rosenbaum, Andy
    Soltan, Saleh
    Gupta, Rahul
    INTERSPEECH 2023, 2023, : 755 - 759
  • [2] A Bayesian approach to mitigation of publication bias
    Maime Guan
    Joachim Vandekerckhove
    Psychonomic Bulletin & Review, 2016, 23 : 74 - 86
  • [3] A Bayesian approach to mitigation of publication bias
    Guan, Maime
    Vandekerckhove, Joachim
    PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (01) : 74 - 86
  • [4] A simple correction for COVID-19 sampling bias
    Diaz-Pachon, Daniel Andres
    Rao, J. Sunil
    JOURNAL OF THEORETICAL BIOLOGY, 2021, 512
  • [5] Demonstration and Mitigation of Spatial Sampling Bias for Machine-Learning Predictions
    Liu, Wendi
    Ikonnikova, Svetlana
    Hamlin, H. Scott
    Sivila, Livia
    Pyrcz, Michael J.
    SPE RESERVOIR EVALUATION & ENGINEERING, 2021, 24 (01) : 262 - 274
  • [6] First record: A methodological approach to counter sampling bias
    Johnson, LC
    Beaton, R
    Murphy, SA
    PSYCHOLOGICAL REPORTS, 2004, 95 (02) : 391 - 392
  • [7] SAMPLING BIAS IN RESPIROMETRY
    HAYES, JP
    SPEAKMAN, JR
    RACEY, PA
    PHYSIOLOGICAL ZOOLOGY, 1992, 65 (03): : 604 - 619
  • [8] NOZZLE SAMPLING BIAS
    SELDEN, MG
    AMERICAN INDUSTRIAL HYGIENE ASSOCIATION JOURNAL, 1975, 36 (07): : 549 - 552
  • [9] Water sampling bias
    不详
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2000, 19 (07) : IV - V
  • [10] NOZZLE SAMPLING BIAS
    SELDEN, MG
    AMERICAN INDUSTRIAL HYGIENE ASSOCIATION JOURNAL, 1971, 32 (05): : 62 - &