A Scalable Embedding Based Neural Network Method for Discovering Knowledge From Biomedical Literature

被引:8
|
作者
Sang, Shengtian [1 ]
Liu, Xiaoxia [2 ]
Chen, Xiaoyu [3 ]
Zhao, Di [3 ]
机构
[1] Stanford Univ, Dept Med, Stanford, CA 94305 USA
[2] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Peoples R China
[3] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116026, Peoples R China
关键词
Unified modeling language; Biological system modeling; Diseases; Drugs; Semantics; Deep learning; Task analysis; Literature-based discovery; knowledge graph; bidirectional recurrent neural network; drug discovery; FISH-OIL; PREDICTION; TEXT;
D O I
10.1109/TCBB.2020.3003947
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Nowadays, the amount of biomedical literatures is growing at an explosive speed, and much useful knowledge is yet undiscovered in the literature. Classical information retrieval techniques allow to access explicit information from a given collection of information, but are not able to recognize implicit connections. Literature-based discovery (LBD) is characterized by uncovering hidden associations in non-interacting literature. It could significantly support scientific research by identifying new connections between biomedical entities. However, most of the existing approaches to LBD are not scalable and may not be sufficient to detect complex associations in non-directly-connected literature. In this article, we present a model which incorporates biomedical knowledge graph, graph embedding, and deep learning methods for literature-based discovery. First, the relations between biomedical entities are extracted from biomedical abstracts and then a knowledge graph is constructed by using these obtained relations. Second, the graph embedding technologies are applied to convert the entities and relations in the knowledge graph into a low-dimensional vector space. Third, a bidirectional Long Short-Term Memory (BLSTM) network is trained based on the entity associations represented by the pre-trained graph embeddings. Finally, the learned model is used for open and closed literature-based discovery tasks. The experimental results show that our method could not only effectively discover hidden associations between entities, but also reveal the corresponding mechanism of interactions. It suggests that incorporating knowledge graph and deep learning methods is an effective way for capturing the underlying complex associations between entities hidden in the literature.
引用
收藏
页码:1294 / 1301
页数:8
相关论文
共 50 条
  • [21] Multisource hierarchical neural network for knowledge graph embedding
    Jiang, Dan
    Wang, Ronggui
    Xue, Lixia
    Yang, Juan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [22] Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature
    Song, Chenyang
    Zeng, Zheni
    Tian, Changyao
    Li, Kuai
    Yao, Yuan
    Zheng, Suncong
    Liu, Zhiyuan
    Sun, Maosong
    [J]. AI Open, 2024, 5 : 104 - 114
  • [23] A method to establish a neural network based on the whole knowledge
    Wang, YZ
    Yu, MS
    Yu, J
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1364 - 1369
  • [24] Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature
    Gajendran, Sudhakaran
    Manjula, D.
    Sugumaran, Vijayan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 112
  • [25] Discovering breast cancer drug candidates from biomedical literature
    Li, Jiao
    Zhu, Xiaoyan
    Chen, Jake Yue
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (03) : 241 - 255
  • [26] Chemical-protein interaction extraction from biomedical literature: a hierarchical recurrent convolutional neural network method
    Sun, Cong
    Yang, Zhihao
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    Yang, Liang
    Xu, Kan
    Zhang, Yijia
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2019, 22 (02) : 113 - 130
  • [27] SCE: Scalable Network Embedding from Sparsest Cut
    Zhang, Shengzhong
    Huang, Zengfeng
    Zhou, Haicang
    Zhou, Ziang
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 257 - 265
  • [28] Mining hidden knowledge: embedding models of cause-effect relationships curated from the biomedical literature
    Kraemer, Andreas
    Green, Jeff
    Billaud, Jean-Noel
    Pasare, Nicoleta Andreea
    Jones, Martin
    Tugendreich, Stuart
    Arighi, Cecilia
    [J]. BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [29] Medical knowledge embedding based on recursive neural network for multi-disease diagnosis
    Jiang, Jingchi
    Wang, Huanzheng
    Xie, Jing
    Guo, Xitong
    Guan, Yi
    Yu, Qiubin
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 103 (103)
  • [30] A Framework To Build A Causal Knowledge Graph for Chronic Diseases and Cancers By Discovering Semantic Associations from Biomedical Literature
    Daowd, Ali
    Barrett, Michael
    Abidi, Samina
    Abidi, Syed Sibte Raza
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 13 - 22