Annotating and detecting phenotypic information for chronic obstructive pulmonary disease

被引:7
|
作者
Ju, Meizhi [1 ]
Short, Andrea D. [2 ]
Thompson, Paul [1 ]
Bakerly, Nawar Diar [3 ,4 ]
Gkoutos, Georgios, V [5 ,6 ,7 ,8 ,9 ,10 ]
Tsaprouni, Loukia [11 ]
Ananiadou, Sophia [1 ]
机构
[1] Univ Manchester, Natl Ctr Text Min, Sch Comp Sci, Manchester, Lancs, England
[2] Univ Manchester, Fac Biol Med & Hlth, Manchester, Lancs, England
[3] Univ Manchester, Salford Royal NHS Fdn Trust, Manchester, Lancs, England
[4] Univ Manchester, Sch Hlth Sci, Manchester, Lancs, England
[5] Univ Birmingham, Coll Med & Dent Sci, Ctr Computat Biol, Inst Canc & Genom Sci, Birmingham, W Midlands, England
[6] Univ Hosp Birmingham NHS Fdn Trust, Inst Translat Med, Birmingham, W Midlands, England
[7] MRC Hlth Data Res UK HDR UK, Birmingham, W Midlands, England
[8] NIHR Expt Canc Med Ctr, Birmingham, W Midlands, England
[9] NIHR Surg Reconstruct & Microbiol Res Ctr, Birmingham, W Midlands, England
[10] NIHR Biomed Res Ctr, Birmingham, W Midlands, England
[11] Birmingham City Univ, Ctr Life & Sport Sci, Sch Hlth Sci, Birmingham, W Midlands, England
基金
欧盟地平线“2020”; 美国国家科学基金会;
关键词
chronic obstructive pulmonary disease; text mining; natural language processing; phenotype; information extraction; ELECTRONIC HEALTH RECORD; PERSONALIZED MEDICINE; CLINICAL PHENOTYPES; PRECISION MEDICINE; SYSTEM; COPD; IDENTIFICATION; CORPUS; TEXT; EXTRACTION;
D O I
10.1093/jamiaopen/ooz009
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: Chronic obstructive pulmonary disease (COPD) phenotypes cover a range of lung abnormalities. To allow text mining methods to identify pertinent and potentially complex information about these phenotypes from textual data, we have developed a novel annotated corpus, which we use to train a neural network-based named entity recognizer to detect fine-grained COPD phenotypic information. Materials and methods: Since COPD phenotype descriptions often mention other concepts within them (proteins, treatments, etc.), our corpus annotations include both outermost phenotype descriptions and concepts nested within them. Our neural layered bidirectional long short-term memory conditional random field (BiLSTM-CRF) network firstly recognizes nested mentions, which are fed into subsequent BiLSTM-CRF layers, to help to recognize enclosing phenotype mentions. Results: Our corpus of 30 full papers (available at: http://www.nactem.ac.uk/COPD) is annotated by experts with 27 030 phenotype-related concept mentions, most of which are automatically linked to UMLS Metathesaurus concepts. When trained using the corpus, our BiLSTM-CRF network outperforms other popular approaches in recognizing detailed phenotypic information. Discussion: Information extracted by our method can facilitate efficient location and exploration of detailed information about phenotypes, for example, those specifically concerning reactions to treatments. Conclusion: The importance of our corpus for developing methods to extract fine-grained information about COPD phenotypes is demonstrated through its successful use to train a layered BiLSTM-CRF network to extract phenotypic information at various levels of granularity. The minimal human intervention needed for training should permit ready adaption to extracting phenotypic information about other diseases.
引用
收藏
页码:261 / 271
页数:11
相关论文
共 50 条
  • [1] Phenotypic Heterogeneity of Chronic Obstructive Pulmonary Disease
    Garcia-Aymerich, Judith
    Agusti, Alvar
    Barbera, Joan A.
    Belda, Jose
    Farrero, Eva
    Ferrer, Antoni
    Ferrer, Jaume
    Galdiz, Juan B.
    Gea, Joaquim
    Gomez, Federico P.
    Monso, Eduard
    Morera, Josep
    Roca, Josep
    Sauleda, Jaume
    Anto, Josep M.
    ARCHIVOS DE BRONCONEUMOLOGIA, 2009, 45 (03): : 129 - 138
  • [2] Bronchial asthma and chronic obstructive pulmonary disease - phenotypic similarities
    Fal, Andrzej M.
    Pawlowicz, Robert
    ALERGIA ASTMA IMMUNOLOGIA, 2012, 17 (01): : 11 - 15
  • [3] The Association Of Adiponectin With Phenotypic Parameters In Chronic Obstructive Pulmonary Disease
    Carolan, B. J.
    Williams, A. A.
    Bowler, R. P.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2013, 187
  • [4] Bronchodilator responsiveness as a phenotypic characteristic of established chronic obstructive pulmonary disease
    Albert, Paul
    Agusti, Alvar
    Edwards, Lisa
    Tal-Singer, Ruth
    Yates, Julie
    Bakke, Per
    Celli, Bartolome R.
    Coxson, Harvey O.
    Crim, Courtney
    Lomas, David A.
    MacNee, William
    Miller, Bruce
    Rennard, Stephen
    Silverman, Edwin K.
    Vestbo, Jorgen
    Wouters, Emiel
    Calverley, Peter
    THORAX, 2012, 67 (08) : 701 - 708
  • [5] Phenotypic predictors of chronic obstructive pulmonary disease exacerbations in the BIOAIR study
    Strandberg, Karin
    Janson, Christer
    Dahlen, Barbro
    Kupczyk, Maciek
    Dahlen, Sven-Erik
    EUROPEAN RESPIRATORY JOURNAL, 2016, 48
  • [6] Phenotypic Profiling of Immune Cells and Their Mediators in Chronic Obstructive Pulmonary Disease
    Sampath, Meghashree
    Bade, Geetanjali
    Guleria, Randeep
    Mohan, Anant
    Sen, Sudip
    Dey, Devanjan
    Talwar, Anjana
    BIOMEDICINES, 2023, 11 (08)
  • [7] Phenotypic Variation in Patients with Chronic Obstructive Pulmonary Disease in Primary Care
    Beekman, Emmylou
    Mesters, Ilse
    Spigt, Mark G.
    van Eerd, Eva A. M.
    Gosselink, Rik
    de Bie, Rob A.
    van Schayck, Onno C. P.
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [8] Phenotypic characteristics of patients with arterial hypertension and chronic obstructive pulmonary disease
    Agapova, O.
    Dolgusheva, Y.
    Ratova, L.
    Chazova, I.
    Zykov, K.
    EUROPEAN RESPIRATORY JOURNAL, 2012, 40
  • [9] Phenotypic And Functional Properties Of Bronchial Epithelium In Chronic Obstructive Pulmonary Disease
    Ferhani, N.
    Brard, J.
    Grandsaigne, M.
    Taille, C.
    Fabre, A.
    Benadda, S.
    Danel, C.
    Castier, Y.
    Aubier, M.
    Pretolani, M.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2013, 187
  • [10] Deep Learning Based Algorithms for Detecting Chronic Obstructive Pulmonary Disease
    Melese, Endashaw Amsalu
    Nabaasa, Evarist
    Wondemagegn, Matiwos Tekalign
    Yonasi, Safari
    Negasa, Gemechis Melkamu
    2022 IST-AFRICA CONFERENCE, 2022,