Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data

被引:600
|
作者
Gianfrancesco, Milena A. [1 ]
Tamang, Suzanne [2 ]
Yazdany, Jinoos [1 ]
Schmajuk, Gabriela [1 ,3 ]
机构
[1] Univ Calif San Francisco, Dept Med, Div Rheumatol, 513 Parnassus Ave, San Francisco, CA 94143 USA
[2] Stanford Univ, Ctr Populat Hlth Sci, Palo Alto, CA 94304 USA
[3] Vet Affairs Med Ctr, San Francisco, CA 94121 USA
基金
美国国家卫生研究院; 美国医疗保健研究与质量局;
关键词
CARE; DISEASE; RACE;
D O I
10.1001/jamainternmed.2018.3763
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
A promise of machine learning in health care is the avoidance of biases in diagnosis and treatment; a computer algorithm could objectively synthesize and interpret the data in the medical record. Integration of machine learning with clinical decision support tools, such as computerized alerts or diagnostic support, may offer physicians and others who provide health care targeted and timely information that can improve clinical decisions. Machine learning algorithms, however, may also be subject to biases. The biases include those related to missing data and patients not identified by algorithms, sample size and underestimation, and misclassification and measurement error. There is concern that biases and deficiencies in the data used by machine learning algorithms may contribute to socioeconomic disparities in health care. This Special Communication outlines the potential biases that may be introduced into machine learning-based clinical decision support tools that use electronic health record data and proposes potential solutions to the problems of overreliance on automation, algorithms based on biased data, and algorithms that do not provide information that is clinically meaningful. Existing health care disparities should not be amplified by thoughtless or excessive reliance on machines.
引用
收藏
页码:1544 / 1547
页数:4
相关论文
共 50 条
  • [1] Predictive Modeling Using Artificial Intelligence and Machine Learning Algorithms on Electronic Health Record Data Advantages and Challenges
    Patton, Michael J.
    Liu, Vincent X.
    [J]. CRITICAL CARE CLINICS, 2023, 39 (04) : 647 - 673
  • [2] Using Machine Learning to Identify Health Outcomes from Electronic Health Record Data
    Wong, Jenna
    Murray Horwitz, Mara
    Zhou, Li
    Toh, Sengwee
    [J]. CURRENT EPIDEMIOLOGY REPORTS, 2018, 5 (04) : 331 - 342
  • [3] Using Machine Learning to Identify Health Outcomes from Electronic Health Record Data
    Jenna Wong
    Mara Murray Horwitz
    Li Zhou
    Sengwee Toh
    [J]. Current Epidemiology Reports, 2018, 5 : 331 - 342
  • [4] Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data
    Tedeschi, Sara K.
    Cai, Tianrun
    He, Zeling
    Ahuja, Yuri
    Hong, Chuan
    Yates, Katherine A.
    Dahal, Kumar
    Xu, Chang
    Lyu, Houchen
    Yoshida, Kazuki
    Solomon, Daniel H.
    Cai, Tianxi
    Liao, Katherine P.
    [J]. ARTHRITIS CARE & RESEARCH, 2021, 73 (03) : 442 - 448
  • [5] Classifying Pseudogout Using Machine Learning Approaches with Electronic Health Record Data
    Tedeschi, Sara K.
    Cai, Tianrun
    He, Zeling
    Ahuja, Yuri
    Hong, Chuan
    Yates, Katherine
    Dahal, Kumar
    Xu, Chang
    Lyu, Houchen
    Yoshida, Kazuki
    Solomon, Daniel
    Cai, Tianxi
    Liao, Katherine
    [J]. ARTHRITIS & RHEUMATOLOGY, 2019, 71
  • [6] Identification of postoperative complications using electronic health record data and machine learning
    Bronsert, Michael
    Singh, Abhinav B.
    Henderson, William G.
    Hammermeister, Karl
    Meguid, Robert A.
    Colborn, Kathryn L.
    [J]. AMERICAN JOURNAL OF SURGERY, 2020, 220 (01): : 114 - 119
  • [7] Assessing stroke severity using electronic health record data: a machine learning approach
    Emily Kogan
    Kathryn Twyman
    Jesse Heap
    Dejan Milentijevic
    Jennifer H. Lin
    Mark Alberts
    [J]. BMC Medical Informatics and Decision Making, 20
  • [8] Machine Learning Prognostic Models for Gastrointestinal Bleeding Using Electronic Health Record Data
    Shung, Dennis
    Laine, Loren
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2020, 115 (08): : 1199 - 1200
  • [9] Preoperative Prediction of Postoperative Infections Using Machine Learning and Electronic Health Record Data
    Zhuang, Yaxu
    Dyas, Adam
    Meguid, Robert A.
    Henderson, William G.
    Bronsert, Michael
    Madsen, Helen
    Colborn, Kathryn L.
    [J]. ANNALS OF SURGERY, 2024, 279 (04) : 720 - 726
  • [10] Assessing stroke severity using electronic health record data: a machine learning approach
    Kogan, Emily
    Twyman, Kathryn
    Heap, Jesse
    Milentijevic, Dejan
    Lin, Jennifer H.
    Alberts, Mark
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)