Evaluation of a Natural Language Processing Approach to Identify Diagnostic Errors and Analysis of Safety Learning System Case Review Data: Retrospective Cohort Study

被引:0
|
作者
Tabaie, Azade [1 ,2 ]
Tran, Alberta [3 ]
Calabria, Tony [3 ]
Bennett, Sonita S. [1 ]
Milicia, Arianna [4 ]
Weintraub, William [5 ,6 ]
Gallagher, William James [6 ,7 ]
Yosaitis, John [6 ,8 ]
Schubel, Laura C. [4 ]
Hill, Mary A. [9 ,10 ]
Smith, Kelly Michelle [9 ,10 ]
Miller, Kristen [4 ,6 ]
机构
[1] MedStar Hlth Res Inst, Ctr Biostat Informat & Data Sci, 3007 Tilden St NW, Washington, DC 20008 USA
[2] Georgetown Univ, Sch Med, Dept Emergency Med, Washington, DC USA
[3] MedStar Hlth Res Inst, Dept Qual & Safety, Washington, DC USA
[4] MedStar Hlth Res Inst, Natl Ctr Human Factors Healthcare, Washington, DC USA
[5] MedStar Hlth Res Inst, Populat Hlth, Washington, DC USA
[6] Georgetown Univ, Sch Med, Washington, DC USA
[7] MedStar Hlth Georgetown Washington Hosp Ctr, Family Med Residency Program, Washington, DC USA
[8] MedStar Inst Innovat, MedStar Simulat Training & Educ Lab SiTEL, Washington, DC USA
[9] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[10] Michael Garron Hosp, Toronto, ON, Canada
关键词
diagnostic error; electronic health records; machine learning; natural language processing; NLP; mortality; hospital; risk; lengthof stay; patient harm; diagnostic; EHR;
D O I
10.2196/50935
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Diagnostic errors are an underappreciated cause of preventable mortality in hospitals and pose a risk for severepatient harm and increase hospital length of stay. Objective: This study aims to explore the potential of machine learning and natural language processing techniques in improvingdiagnostic safety surveillance. We conducted a rigorous evaluation of the feasibility and potential to use electronic health recordsclinical notes and existing case review data. Methods: Safety Learning System case review data from 1 large health system composed of 10 hospitals in the mid-Atlanticregion of the United States from February 2016 to September 2021 were analyzed. The case review outcome included opportunitiesfor improvement including diagnostic opportunities for improvement. To supplement case review data, electronic health recordclinical notes were extracted and analyzed. A simple logistic regression model along with 3 forms of logistic regression models(ie, Least Absolute Shrinkage and Selection Operator, Ridge, and Elastic Net) with regularization functions was trained on thisdata to compare classification performances in classifying patients who experienced diagnostic errors during hospitalization.Further, statistical tests were conducted to find significant differences between female and male patients who experienced diagnosticerrors.Results: In total, 126 (7.4%) patients (of 1704) had been identified by case reviewers as having experienced at least 1 diagnosticerror. Patients who had experienced diagnostic error were grouped by sex: 59 (7.1%) of the 830 women and 67 (7.7%) of the 874men. Among the patients who experienced a diagnostic error, female patients were older (median 72, IQR 66-80 vs median 67,IQR 57-76; P=.02), had higher rates of being admitted through general or internal medicine (69.5% vs 47.8%; P=.01), lowerrates of cardiovascular-related admitted diagnosis (11.9% vs 28.4%; P=.02), and lower rates of being admitted through neurology department (2.3% vs 13.4%; P=.04). The Ridge model achieved the highest area under the receiver operating characteristic curve(0.885), specificity (0.797), positive predictive value (PPV; 0.24), and F1-score (0.369) in classifying patients who were at higherrisk of diagnostic errors among hospitalized patients .Conclusions: Our findings demonstrate that natural language processing can be a potential solution to more effectively identifyingand selecting potential diagnostic error cases for review and therefore reducing the case review burden
引用
收藏
页数:13
相关论文
共 17 条
  • [1] The Effects of Natural Language Processing on Big Data Analysis: Sentiment Analysis Case Study
    Khader, Mariam
    Awajan, Arafat
    Al-Naymat, Ghazi
    [J]. 2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2018, : 45 - 51
  • [2] Evaluation of a Natural Language Processing Approach to Identify Social Determinants of Health in Electronic Health Records in a Diverse Community Cohort
    Rouillard, Christopher J.
    Nasser, Mahmoud A.
    Hu, Haihong
    Roblin, Douglas W.
    [J]. MEDICAL CARE, 2022, 60 (03) : 248 - 255
  • [3] Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study
    Ridgway, Jessica P.
    Uvin, Arno
    Schmitt, Jessica
    Oliwa, Tomasz
    Almirol, Ellen
    Devlin, Samantha
    Schneider, John
    [J]. JMIR MEDICAL INFORMATICS, 2021, 9 (03)
  • [4] Combining Natural Language Processing and Federated Learning for Consumer Complaint Analysis: A Case Study on Laptops
    Tahsin M.U.
    Shanto M.S.H.
    Rahman R.M.
    [J]. SN Computer Science, 4 (5)
  • [5] Natural language processing and network analysis in patients withdrawing from life-sustaining treatments: a retrospective cohort study
    Tsai, Wei-Chin
    Tsai, Yun-Cheng
    Kuo, Kuang-Cheng
    Cheng, Shao-Yi
    Tsai, Jaw-Shiun
    Chiu, Tai-Yuan
    Huang, Hsien-Liang
    [J]. BMC PALLIATIVE CARE, 2022, 21 (01)
  • [6] Natural language processing and network analysis in patients withdrawing from life-sustaining treatments: a retrospective cohort study
    Wei-Chin Tsai
    Yun-Cheng Tsai
    Kuang-Cheng Kuo
    Shao-Yi Cheng
    Jaw-Shiun Tsai
    Tai-Yuan Chiu
    Hsien-Liang Huang
    [J]. BMC Palliative Care, 21
  • [7] Learning Analytics Applied to Clinical Diagnostic Reasoning Using a Natural Language Processing-Based Virtual Patient Simulator: Case Study
    Furlan, Raffaello
    Gatti, Mauro
    Mene, Roberto
    Shiffer, Dana
    Marchiori, Chiara
    Levra, Alessandro Giaj
    Saturnino, Vincenzo
    Brunetta, Enrico
    Dipaola, Franca
    [J]. JMIR MEDICAL EDUCATION, 2022, 8 (01):
  • [8] A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
    Li, Bo
    Wang, Beilei
    Zhuang, Pengjie
    Cao, Hongwei
    Wu, Shengyong
    Tan, Zhendong
    Gao, Suizhi
    Li, Penghao
    Jing, Wei
    Shao, Zhuo
    Zheng, Kailian
    Wu, Lele
    Gao, Bai
    Wang, Yang
    Jiang, Hui
    Guo, Shiwei
    He, Liang
    Yang, Yan
    Jin, Gang
    [J]. INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (11) : 3476 - 3489
  • [9] An intelligent patent recommender adopting machine learning approach for natural language processing: A case study for smart machinery technology mining
    Trappey, Amy
    Trappey, Charles V.
    Hsieh, Alex
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 164
  • [10] Using natural language processing to extract self-harm and suicidality data from a clinical sample of patients with eating disorders: a retrospective cohort study
    Cliffe, Charlotte
    Seyedsalehi, Aida
    Vardavoulia, Katerina
    Bittar, Andre
    Velupillai, Sumithra
    Shetty, Hitesh
    Schmidt, Ulrike
    Dutta, Rina
    [J]. BMJ OPEN, 2021, 11 (12):