The Impact of Differential Feature Under-reporting on Algorithmic Fairness

被引:0
|
作者
Akpinar, Nil-Jana [1 ]
Lipton, Zachary C. [2 ]
Chouldechova, Alexandra [2 ]
机构
[1] Amazon AWS AI ML, Seattle, WA 98109 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
MISSING DATA; HEALTH-CARE; BIAS; MISCLASSIFICATION; VARIABLES; MODEL;
D O I
10.1145/3630106.3658977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive risk models in the public sector are commonly developed using administrative data that is more complete for subpopulations that more greatly rely on public services. In the United States, for instance, information on health care utilization is routinely available to government agencies for individuals supported by Medicaid and Medicare, but not for the privately insured. Critiques of public sector algorithms have identified such "differential feature underreporting" as a driver of disparities in algorithmic decision-making. Yet this form of data bias remains understudied from a technical viewpoint. While prior work has examined the fairness impacts of additive feature noise and features that are clearly marked as missing, little is known about the setting of data missingness absent indicators (i.e. differential feature under-reporting). In this work, we study an analytically tractable model of differential feature underreporting to characterize the impact of under-report on algorithmic fairness. We demonstrate how standard missing data methods typically fail to mitigate bias in this setting, and propose a new set of augmented loss and imputation methods. Our results show that, in real world data settings, under-reporting typically exacerbates disparities. The proposed solution methods show some success in mitigating disparities attributable to feature under-reporting.
引用
收藏
页码:1355 / 1382
页数:28
相关论文
共 50 条
  • [1] GIST - The Impact of Under-Reporting?
    Kwong, M. M.
    Guthrie, L.
    Choi, A. H.
    Greas, M.
    Tolentino, R.
    Shope, T.
    Senthil, M.
    Reeves, M.
    Solomon, N.
    Selleck, M.
    ANNALS OF SURGICAL ONCOLOGY, 2020, 27 (SUPPL 1) : S192 - S192
  • [2] The under-reporting of research impact
    Kostoff, RN
    SCIENTIST, 1998, 12 (18): : 9 - 9
  • [3] UNDER-REPORTING OF AIDS
    STUART, J
    SOUTH AFRICAN MEDICAL JOURNAL, 1993, 83 (09): : 689 - 689
  • [4] Automated Feature Engineering for Algorithmic Fairness
    Salazar, Ricardo
    Neutatz, Felix
    Abedjan, Ziawasch
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (09): : 1694 - 1702
  • [5] Causal Feature Selection for Algorithmic Fairness
    Galhotra, Sainyam
    Shanmugam, Karthikeyan
    Sattigeri, Prasanna
    Varshney, Kush R.
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 276 - 285
  • [6] Under-reporting of tuberculosis disease
    Garcia-de Cruz, Susana
    Aldea-Mansilla, Carmen
    Sordo, Valentin del Villar
    MEDICINA CLINICA, 2017, 149 (03): : 131 - 131
  • [7] Under-reporting of fisheries in Caribbean
    Kingston, P.
    MARINE POLLUTION BULLETIN, 2016, 111 (1-2) : 4 - 4
  • [8] Modelling under-reporting in epidemics
    Gamado, Kokouvi M.
    Streftaris, George
    Zachary, Stan
    JOURNAL OF MATHEMATICAL BIOLOGY, 2014, 69 (03) : 737 - 765
  • [9] Modelling under-reporting in epidemics
    Kokouvi M. Gamado
    George Streftaris
    Stan Zachary
    Journal of Mathematical Biology, 2014, 69 : 737 - 765
  • [10] Under-reporting of maritime accidents
    Psarros, George
    Skjong, Rolf
    Eide, Magnus Strandmyr
    ACCIDENT ANALYSIS AND PREVENTION, 2010, 42 (02): : 619 - 625