Evaluating Algorithmic Bias in 30-Day Hospital Readmission Models: Retrospective Analysis

被引:2
|
作者
Wang, H. Echo [1 ]
Weiner, Jonathan P. [1 ,2 ]
Saria, Suchi [3 ]
Kharrazi, Hadi [1 ,2 ]
机构
[1] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, 624 N Broadway,Hampton House, Baltimore, MD 21205 USA
[2] Johns Hopkins Ctr Populat Hlth Informat Technol, Baltimore, MD USA
[3] Johns Hopkins Univ, Whiting Sch Engn, Baltimore, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
algorithmic bias; model bias; predictive models; model fairness; health disparity; hospital readmission; retrospective analysis; ARTIFICIAL-INTELLIGENCE; MEDICARE BENEFICIARIES; HEALTH DISPARITIES; RISK; CARE; RACE; IMPLEMENTATION; VALIDATION; BLACKS; WHITES;
D O I
10.2196/47125
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The adoption of predictive algorithms in health care comes with the potential for algorithmic bias, which could exacerbate existing disparities. Fairness metrics have been proposed to measure algorithmic bias, but their application to real -world tasks is limited. Objective: This study aims to evaluate the algorithmic bias associated with the application of common 30 -day hospital readmission models and assess the usefulness and interpretability of selected fairness metrics. Methods: We used 10.6 million adult inpatient discharges from Maryland and Florida from 2016 to 2019 in this retrospective study. Models predicting 30 -day hospital readmissions were evaluated: LACE Index, modified HOSPITAL score, and modified Centers for Medicare & Medicaid Services (CMS) readmission measure, which were applied as -is (using existing coefficients) and retrained (recalibrated with 50% of the data). Predictive performances and bias measures were evaluated for all, between Black and White populations, and between low- and other -income groups. Bias measures included the parity of false negative rate (FNR), false positive rate (FPR), 0-1 loss, and generalized entropy index. Racial bias represented by FNR and FPR differences was stratified to explore shifts in algorithmic bias in different populations. Results: The retrained CMS model demonstrated the best predictive performance (area under the curve: 0.74 in Maryland and 0.68-0.70 in Florida), and the modified HOSPITAL score demonstrated the best calibration (Brier score: 0.16-0.19 in Maryland and 0.19-0.21 in Florida). Calibration was better in White (compared to Black) populations and other -income (compared to low-income) groups, and the area under the curve was higher or similar in the Black (compared to White) populations. The retrained CMS and modified HOSPITAL score had the lowest racial and income bias in Maryland. In Florida, both of these models overall had the lowest income bias and the modified HOSPITAL score showed the lowest racial bias. In both states, the White and higher -income populations showed a higher FNR, while the Black and low-income populations resulted in a higher FPR and a higher 0-1 loss. When stratified by hospital and population composition, these models demonstrated heterogeneous algorithmic bias in different contexts and populations. Conclusions: Caution must be taken when interpreting fairness measures' face value. A higher FNR or FPR could potentially reflect missed opportunities or wasted resources, but these measures could also reflect health care use patterns and gaps in care. Simply relying on the statistical notions of bias could obscure or underplay the causes of health disparity. The imperfect health data, analytic frameworks, and the underlying health systems must be carefully considered. Fairness measures can serve as a useful routine assessment to detect disparate model performances but are insufficient to inform mechanisms or policy changes. However, such an assessment is an important first step toward data -driven improvement to address existing health disparities.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A bias evaluation checklist for predictive models and its pilot application for 30-day hospital readmission models
    Wang, H. Echo Echo
    Landers, Matthew
    Adams, Roy
    Subbaswamy, Adarsh
    Kharrazi, Hadi
    Gaskin, Darrell J.
    Saria, Suchi
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (08) : 1323 - 1333
  • [2] Validation of 30-Day Pediatric Hospital Readmission Risk Prediction Models
    Carroll, Alison R.
    Hall, Matthew
    Harris, Mitch
    Carroll, Michael S.
    Auger, Katherine A.
    Davis, Matthew M.
    Goodman, Denise M.
    Williams, Derek J.
    JAMA NETWORK OPEN, 2025, 8 (02)
  • [3] Early Prediction of Unplanned 30-Day Hospital Readmission: Model Development and Retrospective Data Analysis
    Zhao, Peng
    Yoo, Illhoi
    Naqvi, Syed H.
    JMIR MEDICAL INFORMATICS, 2021, 9 (03)
  • [4] 30-Day Hospital Readmission for Granulomatosis with Polyangiitis: Analysis from National Readmission Database
    Luo, Yiming
    Jiang, Changchuan
    Molina, Ana Belen Arevalo
    Murray, Shane
    Salgado, Maria
    Xu, Jiehui
    ARTHRITIS & RHEUMATOLOGY, 2018, 70
  • [5] Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission
    Caruana, Rich
    Lou, Yin
    Gehrke, Johannes
    Koch, Paul
    Sturm, Marc
    Elhadad, Noemie
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1721 - 1730
  • [6] A meta-analysis of hospital 30-day avoidable readmission rates
    van Walraven, Carl
    Jennings, Alison
    Forster, Alan J.
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2012, 18 (06) : 1211 - 1218
  • [7] Prediction of 30-day unplanned hospital readmission through survival analysis
    Pons-Suner, Pedro
    Arnal, Laura
    Signol, Francois
    Mateos, M. Jose Caballero
    Martinez, Bernardo Valdivieso
    Perez-Cortes, Juan-Carlos
    HELIYON, 2023, 9 (10)
  • [8] Assessing racial bias in healthcare predictive models: Practical lessons from an empirical evaluation of 30-day hospital readmission models
    Wang, H. Echo
    Weiner, Jonathan P.
    Saria, Suchi
    Lehmann, Harold
    Kharrazi, Hadi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 156
  • [9] Accuracy of Prospective Predictions of 30-Day Hospital Readmission
    Reddy, Maya
    Schneiders-Rice, Susan
    Pierce, Casey
    Fitzmaurice, Garrett
    Busch, Alisa
    PSYCHIATRIC SERVICES, 2016, 67 (02) : 244 - 247
  • [10] Hospital Performance Measures and 30-day Readmission Rates
    Mihaela S. Stefan
    Penelope S. Pekow
    Wato Nsa
    Aruna Priya
    Lauren E. Miller
    Dale W. Bratzler
    Michael B. Rothberg
    Robert J. Goldberg
    Kristie Baus
    Peter K. Lindenauer
    Journal of General Internal Medicine, 2013, 28 : 377 - 385