Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein-protein interaction

被引:47
|
作者
Yadav, Shweta [1 ]
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
Kumar, Ankit [1 ]
Bhattacharyya, Pushpak [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Relation extraction; Protein-protein interaction; Bi-directional long short term memory(Bi-LSTM); Stacked attention; Deep learning; Shortest dependency path; Support vector machine; INTERACTION EXTRACTION; INFORMATION; NETWORK; NAMES;
D O I
10.1016/j.knosys.2018.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge about protein-protein interactions is essential for understanding the biological processes such as metabolic pathways, DNA replication, and transcription etc. However, a majority of the existing Protein-Protein Interaction (PPI) systems are dependent primarily on the scientific literature, which is not yet accessible as a structured database. Thus, efficient information extraction systems are required for identifying PPI information from the large collection of biomedical texts. In this paper, we present a novel method based on attentive deep recurrent neural network, which combines multiple levels of representations exploiting word sequences and dependency path related information to identify protein-protein interaction (PPI) information from the text. We use the stacked attentive bi-directional long short term memory (Bi-LSTM) as our recurrent neural network to solve the PPI identification problem. This model leverages joint modeling of proteins and relations in a single unified framework, which is named as the 'Attentive Shortest Dependency Path LSTM' (Att-sdpLSTM) model. Experimentation of the proposed technique was conducted on five popular benchmark PPI datasets, namely AiMed, Biolnfer, HPRD50, IEPA, and LLL The evaluation shows the F1-score values of 93.29%, 81.68%, 78.73%, 76.25%, & 83.92% on AiMed, Biolnfer, HPRD50, IEPA, and LLL dataset, respectively. Comparisons with the existing systems show that our proposed approach attains state-of-the-art performance. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 50 条
  • [1] Shortest path-based analysis of protein-protein interaction networks
    Li, Min
    Chen, Jianer
    Wang, Jianxin
    Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (01): : 89 - 94
  • [2] A Shortest Dependency Path Based Convolutional Neural Network for Protein-Protein Relation Extraction
    Hua, Lei
    Quan, Chanqin
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [3] Exploiting Dependency Information for Feature-based Protein-Protein Interaction Extraction
    Liu, Bing
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 99 - 102
  • [4] Exploiting Dependency Information for Feature-Based Protein-Protein Interaction Extraction
    Liu, Bing
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    PROCEEDINGS OF THE 2011 INTERNATIONAL CONFERENCE ON INFORMATICS, CYBERNETICS, AND COMPUTER ENGINEERING (ICCE2011), VOL 2: INFORMATION SYSTEMS AND COMPUTER ENGINEERING, 2011, 111 : 267 - 272
  • [5] Exploiting dependency information for feature-based protein-protein interaction extraction
    Liu B.
    Qian L.
    Zhou G.
    Zhu Q.
    Advances in Intelligent and Soft Computing, 2011, 111 : 267 - 272
  • [6] The Approximability of Shortest Path-Based Graph Orientations of Protein-Protein Interaction Networks
    Blokh, Dima
    Segev, Danny
    Sharan, Roded
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (12) : 945 - 957
  • [7] Identification of retinoblastoma related genes with shortest path in a protein-protein interaction network
    Li, Bi-Qing
    Zhang, Jian
    Huang, Tao
    Zhang, Lei
    Cai, Yu-Dong
    BIOCHIMIE, 2012, 94 (09) : 1910 - 1917
  • [8] Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme
    Chen, Kuan-Hsi
    Wang, Tsai-Feng
    Hu, Yuh-Jyh
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [9] Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme
    Kuan-Hsi Chen
    Tsai-Feng Wang
    Yuh-Jyh Hu
    BMC Bioinformatics, 20
  • [10] A Bi-LSTM Based Ensemble Algorithm for Prediction of Protein Secondary Structure
    Hu, Hailong
    Li, Zhong
    Elofsson, Arne
    Xie, Shangxin
    APPLIED SCIENCES-BASEL, 2019, 9 (17):