Semantic classification of biomedical concepts using distributional similarity

被引:22
|
作者
Fan, Jung-Wei [1 ]
Friedman, Carol [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY USA
关键词
D O I
10.1197/jamia.M2314
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To develop an automated, high-throughput, and reproducible method for reclassifying and validating ontological concepts for natural language processing applications. Design: We developed a distributional similarity approach to classify the Unified Medical Language System (UMLS) concepts. Classification models were built for seven broad biomedically relevant semantic classes created by grouping subsets of the UMLS semantic types. We used contextual features based on syntactic properties obtained from two different large corpora and used alpha-skew divergence as the similarity measure. Measurements: The testing sets were automatically generated based on the changes by the National Library of Medicine to the semantic classification of concepts from the UMLS 2005AA to the 2006AA release. Error rates were calculated and a misclassification analysis was performed. Results: The estimated lowest error rates were 0.198 and 0.116 when considering the correct classification to be covered by our top prediction and top 2 predictions, respectively. Conclusion: The results demonstrated that the distributional similarity approach can recommend high level semantic classification suitable for use in natural language processing.
引用
收藏
页码:467 / 477
页数:11
相关论文
共 50 条
  • [1] Computing semantic similarity between biomedical concepts using new information content approach
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 59 : 258 - 275
  • [2] Association measures for estimating semantic similarity and relatedness between biomedical concepts
    Henry, Sam
    McQuilkin, Alex
    McInnes, Bridget T.
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 93 : 1 - 10
  • [3] Measuring Semantic Similarity Between Biomedical Concepts Within Multiple Ontologies
    Al-Mubaid, Hisham
    Nguyen, Hoa A.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2009, 39 (04): : 389 - 398
  • [4] Similarity is closeness: Using distributional semantic spaces to model similarity in visual and linguistic metaphors
    Bolognesi, Marianna
    Aina, Laura
    [J]. CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2019, 15 (01) : 101 - 137
  • [5] Semantic Similarity in Biomedical Ontologies
    Pesquita, Catia
    Faria, Daniel
    Falcao, Andre O.
    Lord, Phillip
    Couto, Francisco M.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
  • [6] Supervised Biomedical Semantic Similarity
    Sousa, Rita. T. T.
    Silva, Sara
    Pesquita, Catia
    [J]. IEEE ACCESS, 2023, 11 : 60635 - 60645
  • [7] Semantic Annotation of Unstructured Documents Using Concepts Similarity
    Pech, Fernando
    Martinez, Alicia
    Estrada, Hugo
    Hernandez, Yasmin
    [J]. SCIENTIFIC PROGRAMMING, 2017, 2017
  • [8] Chinese SNS Blog Classification Using Semantic Similarity
    Shi Chenye
    Li Jianhua
    Chen Jieyuan
    Chen Xiuzhen
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ASPECTS OF SOCIAL NETWORKS (CASON), 2013, : 1 - 6
  • [9] Predicting the relevance of distributional semantic similarity with contextual information
    Muller, Philippe
    Fabre, Ccile
    Adam, Clmentine
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 479 - 488
  • [10] Mining protein interactions from biomedical literature using semantic similarity
    Schmitt, Charles
    Cox, Steven
    Christopherson, Laura
    Scott, Erick
    Firrincieli, Stephen
    Baker, Nancy
    Tutubalina, Elena
    Tropsha, Alexander
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253