A Scalable Embedding Based Neural Network Method for Discovering Knowledge From Biomedical Literature

被引：8

作者：

Sang, Shengtian ^{[1
]}

Liu, Xiaoxia ^{[2
]}

Chen, Xiaoyu ^{[3
]}

Zhao, Di ^{[3
]}

机构：

[1] Stanford Univ, Dept Med, Stanford, CA 94305 USA

[2] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Peoples R China

[3] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116026, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS | 2022年 / 19卷 / 03期

关键词：

Unified modeling language; Biological system modeling; Diseases; Drugs; Semantics; Deep learning; Task analysis; Literature-based discovery; knowledge graph; bidirectional recurrent neural network; drug discovery; FISH-OIL; PREDICTION; TEXT;

D O I：

10.1109/TCBB.2020.3003947

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Nowadays, the amount of biomedical literatures is growing at an explosive speed, and much useful knowledge is yet undiscovered in the literature. Classical information retrieval techniques allow to access explicit information from a given collection of information, but are not able to recognize implicit connections. Literature-based discovery (LBD) is characterized by uncovering hidden associations in non-interacting literature. It could significantly support scientific research by identifying new connections between biomedical entities. However, most of the existing approaches to LBD are not scalable and may not be sufficient to detect complex associations in non-directly-connected literature. In this article, we present a model which incorporates biomedical knowledge graph, graph embedding, and deep learning methods for literature-based discovery. First, the relations between biomedical entities are extracted from biomedical abstracts and then a knowledge graph is constructed by using these obtained relations. Second, the graph embedding technologies are applied to convert the entities and relations in the knowledge graph into a low-dimensional vector space. Third, a bidirectional Long Short-Term Memory (BLSTM) network is trained based on the entity associations represented by the pre-trained graph embeddings. Finally, the learned model is used for open and closed literature-based discovery tasks. The experimental results show that our method could not only effectively discover hidden associations between entities, but also reveal the corresponding mechanism of interactions. It suggests that incorporating knowledge graph and deep learning methods is an effective way for capturing the underlying complex associations between entities hidden in the literature.

引用

页码：1294 / 1301

页数：8

共 50 条

[21] Multisource hierarchical neural network for knowledge graph embedding
Jiang, Dan
Wang, Ronggui
Xue, Lixia
Yang, Juan
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[22] Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature
Song, Chenyang
Zeng, Zheni
Tian, Changyao
Li, Kuai
Yao, Yuan
Zheng, Suncong
Liu, Zhiyuan
Sun, Maosong
[J]. AI Open, 2024, 5 : 104 - 114
[23] A method to establish a neural network based on the whole knowledge
Wang, YZ
Yu, MS
Yu, J
[J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1364 - 1369
[24] Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature
Gajendran, Sudhakaran
Manjula, D.
Sugumaran, Vijayan
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 112
[25] Discovering breast cancer drug candidates from biomedical literature
Li, Jiao
Zhu, Xiaoyan
Chen, Jake Yue
[J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (03) : 241 - 255
[26] Chemical-protein interaction extraction from biomedical literature: a hierarchical recurrent convolutional neural network method
Sun, Cong
Yang, Zhihao
Wang, Lei
Zhang, Yin
Lin, Hongfei
Wang, Jian
Yang, Liang
Xu, Kan
Zhang, Yijia
[J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2019, 22 (02) : 113 - 130
[27] SCE: Scalable Network Embedding from Sparsest Cut
Zhang, Shengzhong
Huang, Zengfeng
Zhou, Haicang
Zhou, Ziang
[J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 257 - 265
[28] Mining hidden knowledge: embedding models of cause-effect relationships curated from the biomedical literature
Kraemer, Andreas
Green, Jeff
Billaud, Jean-Noel
Pasare, Nicoleta Andreea
Jones, Martin
Tugendreich, Stuart
Arighi, Cecilia
[J]. BIOINFORMATICS ADVANCES, 2022, 2 (01):
[29] Medical knowledge embedding based on recursive neural network for multi-disease diagnosis
Jiang, Jingchi
Wang, Huanzheng
Xie, Jing
Guo, Xitong
Guan, Yi
Yu, Qiubin
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 103 (103)
[30] A Framework To Build A Causal Knowledge Graph for Chronic Diseases and Cancers By Discovering Semantic Associations from Biomedical Literature
Daowd, Ali
Barrett, Michael
Abidi, Samina
Abidi, Syed Sibte Raza
[J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 13 - 22

← 1 2 3 4 5 →