Prediction of intra-abdominal injury using natural language processing of electronic medical record data

被引:1
|
作者
Danna, Giovanna [1 ]
Garg, Ravi [2 ]
Buchheit, Joanna [2 ]
Patel, Radha [1 ]
Zhan, Tiannan [2 ]
Ellyn, Alexander [1 ]
Maqbool, Farhan [2 ]
Yala, Linda [2 ]
Moklyak, Yuriy [2 ]
Frydman, James [2 ]
Kho, Abel [2 ]
Kong, Nan [3 ]
Furmanchuk, Alona [2 ]
Lundberg, Alexander [2 ]
Stey, Anne M. [2 ]
机构
[1] Rosalind Franklin Univ, Chicago Med Sch, Chicago, IL USA
[2] Northwestern Univ, Feinberg Sch Med, Chicago, IL USA
[3] Purdue Univ, Weldon Sch Biomed Engn, W Lafayette, IN USA
基金
美国国家卫生研究院;
关键词
BLUNT TRAUMA; IDENTIFY PATIENTS; DIAGNOSIS; VALIDATION; CHILDREN; DISEASE; RISK; TOOL;
D O I
10.1016/j.surg.2024.05.042
中图分类号
R61 [外科手术学];
学科分类号
摘要
Background: This study aimed to use natural language processing to predict the presence of intraabdominal injury using unstructured data from electronic medical records. Methods: This was a random-sample retrospective observational cohort study leveraging unstructured data from injured patients taken to one of 9 acute care hospitals in an integrated health system between 2015 and 2021. Patients with International Classification of Diseases External Cause of Morbidity codes were identified. History and physical, consult, progress, and radiology report text from the first 8 hours of care were abstracted. Annotator dyads independently annotated encounters' text files to establish ground truth regarding whether intra-abdominal injury occurred. Features were extracted from text using natural language processing techniques, bag of words, and principal component analysis. We tested logistic regression, random forests, and gradient boosting machine to determine accuracy, recall, and precision of natural language processing to predict intra-abdominal injury. Results: A random sample of 7,000 patient encounters of 177,127 was annotated. Only 2,951 had sufficient information to determine whether an intra-abdominal injury was present. Among those, 84 (2.9%) had an intra-abdominal injury. The concordance between annotators was 0.989. Logistic regression of features identified with bag of words and principal component analysis had the best predictive ability, with an area under the receiver operating characteristic curve of 0.9, recall of 0.73, and precision of 0.17. Text features with greatest importance included "abdomen," "pelvis," "spleen," and "hematoma." Conclusion: Natural language processing could be a screening decision support tool, which, if paired with human clinical assessment, can maximize precision of intra-abdominal injury identification.
引用
收藏
页码:577 / 585
页数:9
相关论文
共 50 条
  • [1] Electronic Medical Record Data Mining and Processing Based on Natural Language Processing
    Zhang, Shichen
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 212 - 217
  • [2] Prediction of severe chest injury using natural language processing from the electronic health record
    Kulshrestha, Sujay
    Dligach, Dmitriy
    Joyce, Cara
    Baker, Marshall S.
    Gonzalez, Richard
    O'Rourke, Ann P.
    Glazer, Joshua M.
    Stey, Anne
    Kruser, Jacqueline M.
    Churpek, Matthew M.
    Afshar, Majid
    INJURY-INTERNATIONAL JOURNAL OF THE CARE OF THE INJURED, 2021, 52 (02): : 205 - 212
  • [3] Cerebrovascular disease case identification in inpatient electronic medical record data using natural language processing
    Pan, Jie
    Zhang, Zilong
    Peters, Steven Ray
    Vatanpour, Shabnam
    Walker, Robin L.
    Lee, Seungwon
    Martin, Elliot A.
    Quan, Hude
    BRAIN INFORMATICS, 2023, 10 (01)
  • [4] Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data
    Marafino, Ben J.
    Park, Miran
    Davies, Jason M.
    Thombley, Robert
    Luft, Harold S.
    Sing, David C.
    Kazi, Dhruv S.
    DeJong, Colette
    Boscardin, W. John
    Dean, Mitzi L.
    Dudley, R. Adams
    JAMA NETWORK OPEN, 2018, 1 (08)
  • [5] Using natural language processing of the electronic medical record for rapid prospective identification of patients
    Pakhomov, Serguei V.
    Weston, Susan
    Jacobsen, Steven
    Chute, Christopher G.
    Meverden, Ryan
    Roger, Veronique
    CIRCULATION, 2006, 113 (21) : E792 - E792
  • [6] Automated Identification of Postoperative Complications Within an Electronic Medical Record Using Natural Language Processing
    Murff, Harvey J.
    FitzHenry, Fern
    Matheny, Michael E.
    Gentry, Nancy
    Kotter, Kristen L.
    Crimin, Kimberly
    Dittus, Robert S.
    Rosen, Amy K.
    Elkin, Peter L.
    Brown, Steven H.
    Speroff, Theodore
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (08): : 848 - 855
  • [7] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [8] Using natural language processing to identify opioid use disorder in electronic health record data
    Singleton, Jade
    Li, Chengxi
    Akpunonu, Peter D.
    Abner, Erin L.
    Kucharska-Newton, Anna M.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 170
  • [9] THE USE OF NATURAL LANGUAGE PROCESSING TO EXTRACT DATA FROM PSG SLEEP STUDY REPORTS USING NATIONAL VHA ELECTRONIC MEDICAL RECORD DATA
    Nowakowski, S.
    Razjouyan, J.
    Naik, A. D.
    Agrawal, R.
    Velamuri, K.
    Singh, S.
    Sharafkhaneh, A.
    SLEEP, 2020, 43 : A450 - A451
  • [10] Natural Language Processing to Rapidly Identify Potential Signals for Adverse Events Using Electronic Medical Record Data: Example of Arthralgias and Vedolizumab
    Cai, Tianrun
    Kane-Wanger, Gwendolyn
    Bond, Allison
    Cagan, Andrew
    Murphy, Shawn N.
    Ananthakrishnan, Ashwin
    Liao, Katherine
    ARTHRITIS & RHEUMATOLOGY, 2016, 68