Assessing and Mitigating Bias in Medical Artificial Intelligence The Effects of Race and Ethnicity on a Deep Learning Model for ECG Analysis

被引:129
|
作者
Noseworthy, Peter A. [1 ,2 ]
Attia, Zachi, I [1 ]
Brewer, LaPrincess C. [1 ]
Hayes, Sharonne N. [1 ,3 ]
Yao, Xiaoxi [1 ,2 ,4 ]
Kapa, Suraj [1 ]
Friedman, Paul A. [1 ]
Lopez-Jimenez, Francisco [1 ]
机构
[1] Mayo Clin, Dept Cardiovasc Med, 200 1st St SW, Rochester, MN 55905 USA
[2] Mayo Clin, Robert D & Patricia E Kern Ctr Sci Hlth Care Deli, Rochester, MN 55905 USA
[3] Mayo Clin, Off Divers & Inclus, Rochester, MN 55905 USA
[4] Mayo Clin, Div Hlth Care Policy & Res, Dept Hlth Sci Res, Rochester, MN 55905 USA
来源
基金
美国国家卫生研究院;
关键词
artificial intelligence; electrocardiography; humans; machine learning; United States; DYSFUNCTION; SEX;
D O I
10.1161/CIRCEP.119.007988
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Deep learning algorithms derived in homogeneous populations may be poorly generalizable and have the potential to reflect, perpetuate, and even exacerbate racial/ethnic disparities in health and health care. In this study, we aimed to (1) assess whether the performance of a deep learning algorithm designed to detect low left ventricular ejection fraction using the 12-lead ECG varies by race/ethnicity and to (2) determine whether its performance is determined by the derivation population or by racial variation in the ECG. Methods: We performed a retrospective cohort analysis that included 97 829 patients with paired ECGs and echocardiograms. We tested the model performance by race/ethnicity for convolutional neural network designed to identify patients with a left ventricular ejection fraction <= 35% from the 12-lead ECG. Results: The convolutional neural network that was previously derived in a homogeneous population (derivation cohort, n=44 959; 96.2% non-Hispanic white) demonstrated consistent performance to detect low left ventricular ejection fraction across a range of racial/ethnic subgroups in a separate testing cohort (n=52 870): non-Hispanic white (n=44 524; area under the curve [AUC], 0.931), Asian (n=557; AUC, 0.961), black/African American (n=651; AUC, 0.937), Hispanic/Latino (n=331; AUC, 0.937), and American Indian/Native Alaskan (n=223; AUC, 0.938). In secondary analyses, a separate neural network was able to discern racial subgroup category (black/African American [AUC, 0.84], and white, non-Hispanic [AUC, 0.76] in a 5-class classifier), and a network trained only in non-Hispanic whites from the original derivation cohort performed similarly well across a range of racial/ethnic subgroups in the testing cohort with an AUC of at least 0.930 in all racial/ethnic subgroups. Conclusions: Our study demonstrates that while ECG characteristics vary by race, this did not impact the ability of a convolutional neural network to predict low left ventricular ejection fraction from the ECG. We recommend reporting of performance among diverse ethnic, racial, age, and sex groups for all new artificial intelligence tools to ensure responsible use of artificial intelligence in medicine.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Artificial intelligence-Enabled deep learning model for multimodal biometric fusion
    Byeon, Haewon
    Raina, Vikas
    Sandhu, Mukta
    Shabaz, Mohammad
    Keshta, Ismail
    Soni, Mukesh
    Matrouk, Khaled
    Singh, Pavitar Parkash
    Lakshmi, T. R. Vijaya
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 80105 - 80128
  • [32] Deep learning approach to unmask hidden salt effects in the era of artificial intelligence
    Manolis, Athanasios J.
    Kallistratos, Manolis S.
    EUROPEAN HEART JOURNAL, 2023, 44 (42) : 4458 - 4460
  • [33] Classification of Continuous ECG Segments - Performance Analysis of a Deep Learning Model
    Barbosa, Luis C. N.
    Lopes, Diogo
    Escrivaes, Ines
    Moreira, Antonio H. J.
    Carvalho, Vitor
    Vilaca, Joao L.
    Morais, Pedro
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [34] A Survey on Explainable Artificial Intelligence (XAI) Techniques for Visualizing Deep Learning Models in Medical Imaging
    Bhati, Deepshikha
    Neha, Fnu
    Amiruzzaman, Md
    JOURNAL OF IMAGING, 2024, 10 (10)
  • [35] Artificial intelligence of medical things for disease detection using ensemble deep learning and attention mechanism
    Djenouri, Youcef
    Belhadi, Asma
    Yazidi, Anis
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    EXPERT SYSTEMS, 2024, 41 (06)
  • [36] A Review of Artificial Intelligence's Neural Networks (Deep Learning) Applications in Medical Diagnosis and Prediction
    Djavanshir, G. Reza
    Chen, Xinrui
    Yang, Wenhao
    IT PROFESSIONAL, 2021, 23 (03) : 58 - 61
  • [37] Endoluminal larynx anatomy model - towards facilitating deep learning and defining standards for medical images evaluation with artificial intelligence algorithms
    Nogal, Piotr
    Buchwald, Mikolaj
    Staskiewicz, Michalina
    Kupinski, Szymon
    Pukacki, Juliusz
    Mazurek, Cezary
    Jackowska, Joanna
    Wierzbicka, Malgorzata
    POLISH JOURNAL OF OTOLARYNGOLOGY, 2022, 76 (05):
  • [38] An Investigation into the Impact of Deep Learning Model Choice on Sex and Race Bias in Cardiac MR Segmentation
    Lee, Tiarna
    Puyol-Anton, Esther
    Ruijsink, Bram
    Aitcheson, Keana
    Shi, Miaojing
    King, Andrew P.
    CLINICAL IMAGE-BASED PROCEDURES, FAIRNESS OF AI IN MEDICAL IMAGING, AND ETHICAL AND PHILOSOPHICAL ISSUES IN MEDICAL IMAGING, CLIP 2023, FAIMI 2023, EPIMI 2023, 2023, 14242 : 215 - 224
  • [39] Deep Learning and Artificial Intelligence Applied to Model Speech and Language in Parkinson's Disease
    Escobar-Grisales, Daniel
    Rios-Urrego, Cristian David
    Orozco-Arroyave, Juan Rafael
    DIAGNOSTICS, 2023, 13 (13)
  • [40] Artificial intelligence model driven by transfer learning for image-based medical diagnosis
    Okuwobi, Idowu Paul
    Ding, Zhixiang
    Wan, Jifeng
    Ding, Shuxue
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (04) : 4601 - 4612