Development and Validation of a Natural Language Processing Tool to Identify Patients Treated for Pneumonia across VA Emergency Departments

被引:30
|
作者
Jones, B. E. [1 ,2 ]
South, B. R. [3 ]
Shao, Y. [3 ]
Lu, C. C. [4 ]
Leng, J. [4 ]
Sauer, B. C. [1 ,4 ]
Gundlapalli, A. V. [1 ,3 ,5 ,6 ]
Samore, M. H. [1 ,3 ]
Zeng, Q. [1 ,7 ]
机构
[1] VA Salt Lake City Hlth Care Syst, IDEAS Ctr Innovat, 500 Foothill Dr Bldg 2, Salt Lake City, UT 84148 USA
[2] Univ Utah, Sch Med, Div Pulm & Crit Care Med, Salt Lake City, UT USA
[3] Univ Utah, Sch Med, Dept Biomed Informat, Salt Lake City, UT USA
[4] Univ Utah, Sch Med, Dept Epidemiol, Salt Lake City, UT USA
[5] Univ Utah, Sch Med, Dept Internal Med Immunol, Salt Lake City, UT USA
[6] VA Salt Lake City Hlth Care Syst, Dept Internal Med, Salt Lake City, UT USA
[7] George Washington Univ, Sch Med & Hlth Sci, Clin Res & Leadership, Washington, DC 20052 USA
来源
APPLIED CLINICAL INFORMATICS | 2018年 / 9卷 / 01期
关键词
pneumonia; natural language processing; decision-making; diagnosis; surveillance; COMMUNITY-ACQUIRED PNEUMONIA; ADULTS; PERFORMANCE; MORTALITY;
D O I
10.1055/s-0038-1626725
中图分类号
R-058 [];
学科分类号
摘要
Background Identifying pneumonia using diagnosis codes alone may be insufficient for research on clinical decision making. Natural language processing (NLP) may enable the inclusion of cases missed by diagnosis codes. Objectives This article (1) develops a NLP tool that identifies the clinical assertion of pneumonia from physician emergency department (ED) notes, and (2) compares classification methods using diagnosis codes versus NLP against a gold standard of manual chart review to identify patients initially treated for pneumonia. Methods Among a national population of ED visits occurring between 2006 and 2012 across the Veterans Affairs health system, we extracted 811 physician documents containing search terms for pneumonia for training, and 100 random documents for validation. Two reviewers annotated span-and document-level classifications of the clinical assertion of pneumonia. An NLP tool using a support vector machine was trained on the enriched documents. We extracted diagnosis codes assigned in the ED and upon hospital discharge and calculated performance characteristics for diagnosis codes, NLP, and NLP plus diagnosis codes against manual review in training and validation sets. Results Among the training documents, 51% contained clinical assertions of pneumonia; in the validation set, 9% were classified with pneumonia, of which 100% contained pneumonia search terms. After enriching with search terms, the NLP system alone demonstrated a recall/sensitivity of 0.72 (training) and 0.55 (validation), and a precision/positive predictive value (PPV) of 0.89 (training) and 0.71 (validation). ED-assigned diagnostic codes demonstrated lower recall/sensitivity (0.48 and 0.44) but higher precision/PPV (0.95 in training, 1.0 in validation); the NLP system identified more "possible-treated" cases than diagnostic coding. An approach combining NLP and ED-assigned diagnostic coding classification achieved the best performance (sensitivity 0.89 and PPV 0.80). Conclusion System-wide application of NLP to clinical text can increase capture of initial diagnostic hypotheses, an important inclusion when studying diagnosis and clinical decision-making under uncertainty.
引用
收藏
页码:122 / 128
页数:7
相关论文
共 50 条
  • [1] Development and Validation of a Natural Language Processing Tool to Identify Injuries in Infants Associated With Abuse
    Tiyyagura, Gunjan
    Asnes, Andrea G.
    Leventhal, John M.
    Shapiro, Eugene D.
    Auerbach, Marc
    Teng, Wei
    Powers, Emily
    Thomas, Amy
    Lindberg, Daniel M.
    McClelland, Justin
    Kutryb, Carol
    Polzin, Thomas
    Daughtridge, Karen
    Sevin, Virginia
    Hsiao, Allen L.
    [J]. ACADEMIC PEDIATRICS, 2022, 22 (06) : 981 - 988
  • [2] The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
    Rixe, Nancy
    Frisch, Adam
    Wang, Zhendong
    Martin, Judith M.
    Suresh, Srinivasan
    Florin, Todd A.
    Ramgopal, Sriram
    [J]. FRONTIERS IN DIGITAL HEALTH, 2023, 5
  • [3] VALIDATION OF A NATURAL LANGUAGE PROCESSING ALGORITHM TO IDENTIFY COLONIC ADENOMAS ACROSS A HEALTH SYSTEM
    Morgan, David G.
    Chorneyko, Kathy
    Swain, Deepak
    Bowes, Barbara
    Lee, Vicki
    Tinmouth, Jill
    [J]. GASTROENTEROLOGY, 2019, 156 (06) : S56 - S56
  • [4] Natural Language Processing to identify pneumonia from radiology reports
    Dublin, Sascha
    Baldwin, Eric
    Walker, Rod L.
    Christensen, Lee M.
    Haug, Peter J.
    Jackson, Michael L.
    Nelson, Jennifer C.
    Ferraro, Jeffrey
    Carrell, David
    Chapman, Wendy W.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (08) : 834 - 841
  • [5] Development and validation of an assessment tool for nursing workload in emergency departments
    Iordache, Steluta
    Elseviers, Monique
    De Cock, Rita
    Van Rompaey, Bart
    [J]. JOURNAL OF CLINICAL NURSING, 2020, 29 (5-6) : 794 - 809
  • [6] Development and validation of an admission prediction tool for emergency departments in the Netherlands
    Kraaijvanger, Nicole
    Rijpsma, Douwe
    Roovers, Lian
    van Leeuwen, Henk
    Kaasjager, Karin
    van den Brand, Lillian
    Horstink, Laura
    Edwards, Michael
    [J]. EMERGENCY MEDICINE JOURNAL, 2018, 35 (08) : 464 - 470
  • [7] Development and Validation of a Syndrome Definition to Identify Suspected Nonfatal Heroin-Involved Overdoses Treated in Emergency Departments
    Scholl, Lawrence
    Liu, Stephen
    Vivolo-Kantor, Alana
    Board, Amy
    Stein, Zachary
    Roehler, Douglas R.
    McGlone, Londell
    Hoots, Brooke E.
    Mustaquim, Desiree
    Smith, Herschel
    [J]. JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2021, 27 (04): : 369 - 378
  • [8] Building a Natural Language Processing Tool to Identify Patients With High Clinical Suspicion for Kawasaki Disease from Emergency Department Notes
    Doan, Son
    Maehara, Cleo K.
    Chaparro, Juan D.
    Lu, Sisi
    Liu, Ruiling
    Graham, Amanda
    Berry, Erika
    Hsu, Chun-Nan
    Kanegaye, John T.
    Lloyd, David D.
    Ohno-Machado, Lucila
    Burns, Jane C.
    Tremoulet, Adriana H.
    [J]. ACADEMIC EMERGENCY MEDICINE, 2016, 23 (05) : 628 - 636
  • [9] DEVELOPMENT AND VALIDATION OF A NATURAL LANGUAGE PROCESSING SYSTEM TO IDENTIFY INFORMATION ON KEY TRADEOFFS IN PROSTATE CANCER TREATMENT
    Daskivich, Timothy
    Luu, Michael
    Gale, Rebecca
    Khodyakov, Dmitry
    Freedland, Stephen
    Spiegel, Brennan
    [J]. JOURNAL OF UROLOGY, 2024, 211 (05): : E392 - E393
  • [10] VALIDITY OF NATURAL LANGUAGE PROCESSING TO IDENTIFY PATIENTS WITH PROSTATE CANCER
    Thomas, Anil
    Zheng, Chengyi
    Jung, Howard
    Chang, Allen
    Kim, Brian
    Gelfond, Joy
    Slezak, Jeff
    Porter, Kim
    Jacobsen, Steven
    Chien, Gary
    [J]. JOURNAL OF UROLOGY, 2013, 189 (04): : E34 - E34