OntoSem: an Ontology Semantic Representation Methodology for Biomedical Domain

被引:3
|
作者
Zhao, Lingling [1 ]
Wang, Junjie [1 ]
Cheng, Liang [2 ]
Wang, Chunyu [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Med Univ, NHC & CAMS Key Lab Mol Probe & Targeted Theranost, Coll Bioinformat Sci & Technol, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
ontology semantic representation; BERT; Word2Vec; deep learning; semantic similarity; PROTEIN-PROTEIN INTERACTIONS; GENE ONTOLOGY; PREDICTION;
D O I
10.1109/BIBM49941.2020.9313128
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ontologies are essential description tools for biomedical concepts and entities, supporting biomedical fundamental research such as semantic similarity analysis, protein-protein interaction prediction and so on. An increasing amount of ontology-like domain knowledge is published in scientific publications, meanwhile, advanced natural language processing (NLP) techniques have been widespread to extract information from text resources automatically, both of which facilitate the exploration of the semantic representation of biomedical ontologies. We propose a novel distributional semantic representation methodology based on the combination of two pre-trained and domain-specific word embedding tools, the non-contextualized Word2Vec and the context-dependent NCBI-blueBERT, to enhance the encoding ability for biomedical ontologies. Furthermore, we utilize a randomly initialized bidirectional LSTM to project the obtained word vector sequence to a fixed-length sentence vector, facilitating a flexible and uniform way for the computation of downstream tasks. We evaluate our method in two categories of tasks: the similarity access of ontology terms, and the ontology annotationbased protein-protein interaction classification. Experimental results demonstrate that our method provides encouraging results compared to the baselines in all tests. Our approach offers promising opportunities for representing ontologies semantics and in turn characterizing entities including proteins in biomedical research.
引用
收藏
页码:523 / 527
页数:5
相关论文
共 50 条
  • [1] A Semantic Representation of the Knowledge Management Enablers Domain: The aKMEOnt Ontology
    Sabri, Mohammad
    Odeh, Mohammed
    Saad, Mohammed
    [J]. PROCEEDINGS OF THE 18TH EUROPEAN CONFERENCE ON KNOWLEDGE MANAGEMENT (ECKM 2017), VOLS 1 AND 2, 2017, : 1196 - 1204
  • [2] New ontology-based semantic similarity measure for the biomedical domain
    Nguyen, Hoa A.
    Al-Mubaid, Hisham
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 623 - +
  • [3] Methodology for Biomedical Ontology Matching
    Vatascinova, Jana
    [J]. SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, 2019, 11762 : 242 - 250
  • [4] A Methodology for Biomedical Ontology Reuse
    Zulkarnain, Nur Zareen
    Meziane, Farid
    Crofts, Gillian
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2016, 2016, 9612 : 3 - 14
  • [5] DartWiki: A Semantic Wiki for Ontology-Based Knowledge Integration in the Biomedical Domain
    Yu, Tong
    Chen, Huajun
    Mi, Jinhua
    Gu, Peiqin
    Wu, Ting
    Pan, Jeff Z.
    [J]. CURRENT BIOINFORMATICS, 2012, 7 (03) : 278 - 288
  • [6] A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain
    Harispe, Sebastien
    Sanchez, David
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 48 : 38 - 53
  • [7] A Cross-Domain Ontology Semantic Representation Based on NCBI-BlueBERT Embedding
    Zhao Lingling
    Wang Junjie
    Wang Chunyu
    Guo Maozu
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (05) : 860 - 869
  • [8] A Cross-Domain Ontology Semantic Representation Based on NCBI-BlueBERT Embedding
    ZHAO Lingling
    WANG Junjie
    WANG Chunyu
    GUO Maozu
    [J]. Chinese Journal of Electronics, 2022, 31 (05) : 860 - 869
  • [9] Exploring ontology metrics in the biomedical domain
    Manouselis, N.
    Sicilia, M. A.
    Rodriguez, D.
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 2313 - 2321
  • [10] Semantic similarity estimation in the biomedical domain: An ontology-based information-theoretic perspective
    Sanchez, David
    Batet, Montserrat
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (05) : 749 - 759