Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein-protein interaction

被引:47
|
作者
Yadav, Shweta [1 ]
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
Kumar, Ankit [1 ]
Bhattacharyya, Pushpak [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Relation extraction; Protein-protein interaction; Bi-directional long short term memory(Bi-LSTM); Stacked attention; Deep learning; Shortest dependency path; Support vector machine; INTERACTION EXTRACTION; INFORMATION; NETWORK; NAMES;
D O I
10.1016/j.knosys.2018.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge about protein-protein interactions is essential for understanding the biological processes such as metabolic pathways, DNA replication, and transcription etc. However, a majority of the existing Protein-Protein Interaction (PPI) systems are dependent primarily on the scientific literature, which is not yet accessible as a structured database. Thus, efficient information extraction systems are required for identifying PPI information from the large collection of biomedical texts. In this paper, we present a novel method based on attentive deep recurrent neural network, which combines multiple levels of representations exploiting word sequences and dependency path related information to identify protein-protein interaction (PPI) information from the text. We use the stacked attentive bi-directional long short term memory (Bi-LSTM) as our recurrent neural network to solve the PPI identification problem. This model leverages joint modeling of proteins and relations in a single unified framework, which is named as the 'Attentive Shortest Dependency Path LSTM' (Att-sdpLSTM) model. Experimentation of the proposed technique was conducted on five popular benchmark PPI datasets, namely AiMed, Biolnfer, HPRD50, IEPA, and LLL The evaluation shows the F1-score values of 93.29%, 81.68%, 78.73%, 76.25%, & 83.92% on AiMed, Biolnfer, HPRD50, IEPA, and LLL dataset, respectively. Comparisons with the existing systems show that our proposed approach attains state-of-the-art performance. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 50 条
  • [31] A Least Square Method Based Model for Identifying Protein Complexes in Protein-Protein Interaction Network
    Dai, Qiguo
    Guo, Maozu
    Guo, Yingjie
    Liu, Xiaoyan
    Liu, Yang
    Teng, Zhixia
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [32] Shortest Path Analyses in the Protein-Protein Interaction Network of NGAL (Neutrophil Gelatinase-associated Lipocalin) Overexpression in Esophageal Squamous Cell Carcinoma
    Du, Ze-Peng
    Wu, Bing-Li
    Wang, Shao-Hong
    Shen, Jin-Hui
    Lin, Xuan-Hao
    Zheng, Chun-Peng
    Wu, Zhi-Yong
    Qiu, Xiao-Yang
    Zhan, Xiao-Fen
    Xu, Li-Yan
    Li, En-Min
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2014, 15 (16) : 6899 - 6904
  • [33] CF-PPI: Centroid based new feature extraction approach for Protein-Protein Interaction Prediction
    Sahni, Gunjan
    Mewara, Bhawna
    Lalwani, Soniya
    Kumar, Rajesh
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (07) : 1037 - 1057
  • [34] Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm
    Deng, Aijun
    Zhang, Huan
    Wang, Wenyan
    Zhang, Jun
    Fan, Dingdong
    Chen, Peng
    Wang, Bing
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (07)
  • [35] An improved approach to infer protein-protein interaction based on a hierarchical vector space model
    Jiongmin Zhang
    Ke Jia
    Jinmeng Jia
    Ying Qian
    BMC Bioinformatics, 19
  • [36] Evolving protein-protein interaction networks: A model based on duplication and mutation at different rates
    Sun, Jin-Tu
    Ao, Bin
    Zhang, Sheng
    Bing, Zhitong
    Yang, Lei
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 350 : 32 - 36
  • [37] An improved approach to infer protein-protein interaction based on a hierarchical vector space model
    Zhang, Jiongmin
    Jia, Ke
    Jia, Jinmeng
    Qian, Ying
    BMC BIOINFORMATICS, 2018, 19
  • [38] Rolling bearing degradation stage division and RUL prediction based on recursive exponential slow feature analysis and Bi-LSTM model
    Li, Xinliang
    Zhang, Wan
    Ding, Yu
    Cai, Jun
    Yan, Xiaoan
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2025, 259
  • [39] Mining for genes related to choroidal neovascularization based on the shortest path algorithm and protein interaction information
    Zhang, Jian
    Suo, Yan
    Zhang, Yu-Hang
    Zhang, Qing
    Chen, XiJia
    Xu, Xun
    Lu, WenCong
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2740 - 2749
  • [40] Quantitative comparison of protein-protein interaction interface using physicochemical feature-based descriptors of surface patches
    Shin, Woong-Hee
    Kumazawa, Keiko
    Imai, Kenichiro
    Hirokawa, Takatsugu
    Kihara, Daisuke
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2023, 10