Extracting COVID-19 diagnoses and symptoms from clinical text: A new annotated corpus and neural event extraction framework

被引:29
|
作者
Lybarger, Kevin [1 ]
Ostendorf, Mari [2 ]
Thompson, Matthew [3 ]
Yetisgen, Meliha [1 ]
机构
[1] Univ Washington, Biomed & Hlth Informat, Box 358047, Seattle, WA 98109 USA
[2] Univ Washington, Dept Elect & Comp Engn, Campus Box 352500 185, Seattle, WA 98195 USA
[3] Univ Washington, Dept Family Med, Box 354696, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
COVID-19; Coronavirus; Machine learning; Natural language processing; Information extraction; METAMAP;
D O I
10.1016/j.jbi.2021.103761
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven, automatic information extraction models are needed to use this text-encoded information in large-scale studies. This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus, which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical presentation. We introduce a span-based event extraction model that jointly extracts all annotated phenomena, achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83-0.97 F1 for events and 0.73-0.79 F1 for assertions). Our span-based event extraction model outperforms an extractor built on MetaMapLite for the identification of symptoms with assertion values. In a secondary use application, we predicted COVID-19 test results using structured patient data (e.g. vital signs and laboratory results) and automatically extracted symptom information, to explore the clinical presentation of COVID-19. Automatically extracted symptoms improve COVID-19 prediction performance, beyond structured data alone.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Optimal feature selection using novel flamingo search algorithm for classification of COVID-19 patients from clinical text
    Mahdi, Amir Yasseen
    Yuhaniz, Siti Sophiayati
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (03) : 5268 - 5297
  • [42] Clinical Characteristics and Mortality in Patients With Cancer and COVID-19 From the Epicenter in New York City
    Jones, B.
    Lehrer, E. J.
    Salgado, L. Resende
    Shafaee, Z.
    Osborn, W.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2021, 111 (03): : E504 - E504
  • [43] Essential Spine Surgery during the COVID-19 Pandemic: A Comprehensive Framework for Clinical Practice from a Specialty Orthopedic Hospital in New York City
    Soffin, Ellen M.
    Reisener, Marie-Jacqueline
    Sama, Andrew A.
    Beckman, James D.
    Liguori, Gregory A.
    Lebl, Darren R.
    Girardi, Federico P.
    Cammisa, Frank P.
    Hughes, Alexander P.
    HSS JOURNAL, 2020, 16 (1_SUPPL) : 29 - 35
  • [44] New Gastrointestinal Symptoms Are Common in Inflammatory Bowel Disease Patients With COVID-19: Data From an International Registry
    Ungaro, Ryan C.
    Agrawal, Manasi
    Brenner, Erica J.
    Zhang, Xian
    Colombel, Jean-Frederic
    Kappelman, Michael D.
    Reinisch, Walter
    INFLAMMATORY BOWEL DISEASES, 2022, 28 (02) : 314 - 317
  • [45] NEW GASTROINTESTINAL SYMPTOMS ARE COMMON IN INFLAMMATORY BOWEL DISEASE PATIENTS WITH COVID-19: DATA FROM AN INTERNATIONAL REGISTRY
    Ungaro, Ryan C.
    Agrawal, Manasi
    Brenner, Erica J.
    Zhang, Xian
    Colombel, Jean Frederic
    Kappelman, Michael
    Reinisch, Walter
    GASTROENTEROLOGY, 2021, 160 (06) : S332 - S333
  • [46] Psilocybin Therapy for Clinicians With Symptoms of Depression From Frontline Care During the COVID-19 Pandemic:A Randomized Clinical Trial
    Back, Anthony L.
    Freeman-Young, Timara K.
    Morgan, Ladybird
    Sethi, Tanmeet
    Baker, Kelsey K.
    Myers, Susanna
    Mcgregor, Bonnie A.
    Harvey, Kalin
    Tai, Marlene
    Kollefrath, Austin
    Thomas, Brandon J.
    Sorta, Dennis
    Kaelen, Mendel
    Kelmendi, Benjamin
    Gooley, Ted A.
    JAMA NETWORK OPEN, 2024, 7 (12)
  • [47] Evaluation of clinical outcomes of patients with mild symptoms of coronavirus disease 2019 (COVID-19) discharged from the emergency department
    Bagi, Hamidreza Morteza
    Soleimanpour, Maryam
    Abdollahi, Fariba
    Soleimanpour, Hassan
    PLOS ONE, 2021, 16 (10):
  • [48] Epidemiological Assessment of COVID-19 Clinical Symptoms and Its Associated Factors from Banten Districts: The Role of Gender Aspects
    Sari, Flori R.
    Suwarsono, Erike A.
    Adhiyanto, Chris
    Habibi, Ahmad Azwar
    Siregar, Alyya Siddiqa
    Ariany, Devy
    Muniroh, Muniroh
    Jauharoh, Siti NurAisyah
    BANGLADESH JOURNAL OF MEDICAL SCIENCE, 2022, 21 (04): : 782 - 787
  • [49] ConceptWAS: A high-throughput method for early identification of COVID-19 presenting symptoms and characteristics from clinical notes
    Zhao, Juan
    Grabowska, Monika E.
    Kerchberger, Vern Eric
    Smith, Joshua C.
    Eken, H. Nur
    Feng, QiPing
    Peterson, Josh F.
    Rosenbloom, S. Trent
    Johnson, Kevin B.
    Wei, Wei-Qi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [50] Modelling Aotearoa New Zealand's COVID-19 protection framework and the transition away from the elimination strategy
    Vattiato, Giorgia
    Lustig, Audrey
    Maclaren, Oliver
    Binny, Rachelle N. N.
    Hendy, Shaun C. C.
    Harvey, Emily
    O'Neale, Dion
    Plank, Michael J. J.
    ROYAL SOCIETY OPEN SCIENCE, 2023, 10 (02):