Developing a manually annotated clinical document corpus to identify phenotypic information for inflammatory bowel disease

被引:23
|
作者
South, Brett R. [1 ,2 ,3 ]
Shen, Shuying [1 ,2 ,3 ]
Jones, Makoto [2 ]
Garvin, Jennifer [1 ,2 ]
Samore, Matthew H. [1 ,2 ,3 ]
Chapman, Wendy W. [4 ]
Gundlapalli, Adi V. [1 ,2 ,3 ]
机构
[1] IDEAS Ctr, VA Salt Lake City Hlth Care Syst, Salt Lake City, UT 84148 USA
[2] Univ Utah, Div Clin Epidemiol, Dept Internal Med, Salt Lake City, UT 84148 USA
[3] Univ Utah, Dept Biomed Informat, Salt Lake City, UT 84148 USA
[4] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
Inflammatory Bowel Disease; Annotation Schema; Concept Attribute; Annotate Corpus; Clinical Document;
D O I
10.1186/1471-2105-10-S9-S12
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Natural Language Processing (NLP) systems can be used for specific Information Extraction (IE) tasks such as extracting phenotypic data from the electronic medical record (EMR). These data are useful for translational research and are often found only in free text clinical notes. A key required step for IE is the manual annotation of clinical corpora and the creation of a reference standard for (1) training and validation tasks and (2) to focus and clarify NLP system requirements. These tasks are time consuming, expensive, and require considerable effort on the part of human reviewers. Methods: Using a set of clinical documents from the VA EMR for a particular use case of interest we identify specific challenges and present several opportunities for annotation tasks. We demonstrate specific methods using an open source annotation tool, a customized annotation schema, and a corpus of clinical documents for patients known to have a diagnosis of Inflammatory Bowel Disease (IBD). We report clinician annotator agreement at the document, concept, and concept attribute level. We estimate concept yield in terms of annotated concepts within specific note sections and document types. Results: Annotator agreement at the document level for documents that contained concepts of interest for IBD using estimated Kappa statistic (95% CI) was very high at 0.87 (0.82, 0.93). At the concept level, F-measure ranged from 0.61 to 0.83. However, agreement varied greatly at the specific concept attribute level. For this particular use case (IBD), clinical documents producing the highest concept yield per document included GI clinic notes and primary care notes. Within the various types of notes, the highest concept yield was in sections representing patient assessment and history of presenting illness. Ancillary service documents and family history and plan note sections produced the lowest concept yield. Conclusion: Challenges include defining and building appropriate annotation schemas, adequately training clinician annotators, and determining the appropriate level of information to be annotated. Opportunities include narrowing the focus of information extraction to use case specific note types and sections, especially in cases where NLP systems will be used to extract information from large repositories of electronic clinical note documents.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Inflammatory Bowel Disease, Clinical
    Morrison, G.
    Selby, W. S.
    Hetzel, D. J.
    Gibson, P. R.
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2009, 24 : A311 - A311
  • [12] Clinical and Phenotypic Differences in Inflammatory Bowel Disease Among Arab and Jewish Children in Israel
    Firas Rinawi
    Amit Assa
    Husam Bashir
    Sarit Peleg
    Raanan Shamir
    Digestive Diseases and Sciences, 2017, 62 : 2095 - 2101
  • [13] Clinical and Phenotypic Differences in Inflammatory Bowel Disease Among Arab and Jewish Children in Israel
    Rinawi, Firas
    Assa, Amit
    Bashir, Husam
    Peleg, Sarit
    Shamir, Raanan
    DIGESTIVE DISEASES AND SCIENCES, 2017, 62 (08) : 2095 - 2101
  • [14] ASSOCIATION OF INFLAMMATORY BOWEL DISEASE AND AUTOIMMUNE HEPATITIS: EPIDEMIOLOGY, PHENOTYPIC FEATURES AND CLINICAL SIGNIFICANCE
    Lytvyak, Ellina
    Montano-Loza, Aldo J.
    GASTROENTEROLOGY, 2024, 166 (05) : S1744 - S1744
  • [15] PHYSIOLOGICAL METRICS COLLECTED FROM WEARABLE DEVICES IDENTIFY INFLAMMATORY AND CLINICAL INFLAMMATORY BOWEL DISEASE FLARES
    Hirten, Robert
    Danieletto, Matteo
    Landell, Kyle
    Lyu, Jinyan
    Whang, Jessica
    Zweig, Micol
    Helmus, Drew
    Rodrigues, Jovita
    Bottinger, Erwin
    Suarez-Farinas, Mayte
    Nadkarni, Girish
    Fayad, Zahi
    Keefer, Laurie
    Sands, Bruce
    GASTROENTEROLOGY, 2023, 164 (04) : S28 - S28
  • [16] PHYSIOLOGICAL METRICS COLLECTED FROM WEARABLE DEVICES IDENTIFY INFLAMMATORY AND CLINICAL INFLAMMATORY BOWEL DISEASE FLARES
    Hirten, Robert
    Danieletto, Matteo
    Landell, Kyle
    Lyu, Jinyan
    Whang, Jessica
    Zweig, Micol
    Helmus, Drew
    Rodrigues, Jovita
    Bottinger, Erwin
    Suarez-Farinas, Mayte
    Nadkarni, Girish
    Fayad, Zahi
    Keefer, Laurie
    Sands, Bruce
    INFLAMMATORY BOWEL DISEASES, 2023, 29 : S21 - S22
  • [17] Information Resources and Inflammatory Bowel Disease
    Butcher, Rhys Owain
    Limdi, Jimmy K.
    INFLAMMATORY BOWEL DISEASES, 2011, 17 (08) : E89 - E90
  • [18] Phenotypic Concordance in Familial Inflammatory Bowel Disease (IBD)
    Cabre, Eduard
    Manosa, Miriam
    Garcia-Sanchez, Valle
    Gutierrez, Ana
    Panes, Julian
    Esteve, Maria
    Penalva, Mireia
    Nos, Pilar
    Merino, Olga
    Diaz, Angel Ponferrada
    Gisbert, Javier P.
    Garcia-Planella, Esther
    Cena, Gloria
    Cabriada, Jose Luis
    Montoro, Miguel A.
    Domenech, Eugeni
    GASTROENTEROLOGY, 2011, 140 (05) : S735 - S735
  • [19] Description of Clinical Presentations of Inflammatory Bowel Disease (IBD) in Individuals Who Identify as LGBTQIA
    BouSaba, Joelle
    Ghusn, Wissam
    Abboud, Donna Maria
    Yang, Catherine
    Comstock, Bryce
    Neto, Manuel Braga
    Chedid, Victor
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2022, 117 (10): : S603 - S604
  • [20] Development of a Clinical Care Pathway to Identify and Treat Malnutrition in Patients with Inflammatory Bowel Disease
    Hwang, Caroline
    Reddy, Swapna
    Issokson, Kelly
    Giguere-Rich, Catherine
    Tinsley, Andrew
    Bray, Harry
    Lum, Donald
    Aguilar, Humberto
    Zisman, Timothy
    Younes, Ziad
    Nguyen, Anne
    Crate, Damara
    Spinrad, Amelia
    Oberai, Ridhima
    Weaver, Alandra
    Siegel, Corey
    Melmed, Gil
    INFLAMMATORY BOWEL DISEASES, 2017, 23 : S23 - S23