Semantic classification of biomedical concepts using distributional similarity

被引:22
|
作者
Fan, Jung-Wei [1 ]
Friedman, Carol [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY USA
关键词
D O I
10.1197/jamia.M2314
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To develop an automated, high-throughput, and reproducible method for reclassifying and validating ontological concepts for natural language processing applications. Design: We developed a distributional similarity approach to classify the Unified Medical Language System (UMLS) concepts. Classification models were built for seven broad biomedically relevant semantic classes created by grouping subsets of the UMLS semantic types. We used contextual features based on syntactic properties obtained from two different large corpora and used alpha-skew divergence as the similarity measure. Measurements: The testing sets were automatically generated based on the changes by the National Library of Medicine to the semantic classification of concepts from the UMLS 2005AA to the 2006AA release. Error rates were calculated and a misclassification analysis was performed. Results: The estimated lowest error rates were 0.198 and 0.116 when considering the correct classification to be covered by our top prediction and top 2 predictions, respectively. Conclusion: The results demonstrated that the distributional similarity approach can recommend high level semantic classification suitable for use in natural language processing.
引用
收藏
页码:467 / 477
页数:11
相关论文
共 50 条
  • [11] Using MEDLINE as standard corpus for measuring semantic similarity in the biomedical domain
    Al-Mubaid, Hisharn
    Nguyen, Hoa A.
    [J]. BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 315 - +
  • [12] Using topic concepts for semantic video shots classification
    Ayache, Stephane
    Quenot, Georges
    Gensel, Jerome
    Satoh, Shin'ichi
    [J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 300 - 309
  • [13] The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies
    Harispe, Sebastien
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    [J]. BIOINFORMATICS, 2014, 30 (05) : 740 - 742
  • [14] Generating Abstraction Networks using Semantic Similarity Measure of Ontology Concepts
    Cirella, David
    Gu, Huanying
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 840 - 843
  • [15] Measures of semantic similarity and relatedness in the biomedical domain
    Pedersen, Ted
    Pakhomov, Serguei V. S.
    Patwardhan, Siddharth
    Chute, Christopher G.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) : 288 - 299
  • [16] Semantic Similarity Analysis for Examination Questions Classification Using WordNet
    Goh, Thing Thing
    Jamaludin, Nor Azliana Akmal
    Mohamed, Hassan
    Ismail, Mohd Nazri
    Chua, Huangshen
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [17] Assessment of semantic similarity of concepts defined in ontology
    Zadeh, Parisa D. Hossein
    Reformat, Marek Z.
    [J]. INFORMATION SCIENCES, 2013, 250 : 21 - 39
  • [18] Fuzzy semantic similarity between ontological concepts
    Song, Ling
    Ma, Jun
    Liu, Hui
    Lian, Li
    Zhang, Dongmei
    [J]. ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 275 - 280
  • [19] Computing Semantic Similarity of Concepts in Knowledge Graphs
    Zhu, Ganggao
    Iglesias, Carlos A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 72 - 85
  • [20] A unified framework for semantic similarity computation of concepts
    Yuncheng Jiang
    [J]. Multimedia Tools and Applications, 2021, 80 : 32335 - 32378