Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein-protein interaction

被引:47
|
作者
Yadav, Shweta [1 ]
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
Kumar, Ankit [1 ]
Bhattacharyya, Pushpak [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Relation extraction; Protein-protein interaction; Bi-directional long short term memory(Bi-LSTM); Stacked attention; Deep learning; Shortest dependency path; Support vector machine; INTERACTION EXTRACTION; INFORMATION; NETWORK; NAMES;
D O I
10.1016/j.knosys.2018.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge about protein-protein interactions is essential for understanding the biological processes such as metabolic pathways, DNA replication, and transcription etc. However, a majority of the existing Protein-Protein Interaction (PPI) systems are dependent primarily on the scientific literature, which is not yet accessible as a structured database. Thus, efficient information extraction systems are required for identifying PPI information from the large collection of biomedical texts. In this paper, we present a novel method based on attentive deep recurrent neural network, which combines multiple levels of representations exploiting word sequences and dependency path related information to identify protein-protein interaction (PPI) information from the text. We use the stacked attentive bi-directional long short term memory (Bi-LSTM) as our recurrent neural network to solve the PPI identification problem. This model leverages joint modeling of proteins and relations in a single unified framework, which is named as the 'Attentive Shortest Dependency Path LSTM' (Att-sdpLSTM) model. Experimentation of the proposed technique was conducted on five popular benchmark PPI datasets, namely AiMed, Biolnfer, HPRD50, IEPA, and LLL The evaluation shows the F1-score values of 93.29%, 81.68%, 78.73%, 76.25%, & 83.92% on AiMed, Biolnfer, HPRD50, IEPA, and LLL dataset, respectively. Comparisons with the existing systems show that our proposed approach attains state-of-the-art performance. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 50 条
  • [41] A correlation coefficient-based feature selection approach for virus-host protein-protein interaction prediction
    Ibrahim, Ahmed Hassan
    Karabulut, Onur Can
    Karpuzcu, Betuel Asiye
    Turk, Erdem
    Suzek, Baris Ethem
    PLOS ONE, 2023, 18 (05):
  • [42] An Extended Feature Representation Technique for Predicting Sequenced-based Host-pathogen Protein-protein Interaction
    Emmanuel, Jerry
    Isewon, Itunuoluwa
    Olasehinde, Grace
    Oyelade, Jelili
    CURRENT BIOINFORMATICS, 2025, 20 (03) : 229 - 245
  • [43] An uncertain model-based approach for identifying dynamic protein complexes in uncertain protein-protein interaction networks
    Yijia Zhang
    Hongfei Lin
    Zhihao Yang
    Jian Wang
    Yiwei Liu
    BMC Genomics, 18
  • [44] An uncertain model-based approach for identifying dynamic protein complexes in uncertain protein-protein interaction networks
    Zhang, Yijia
    Lin, Hongfei
    Yang, Zhihao
    Wang, Jian
    Liu, Yiwei
    BMC GENOMICS, 2017, 18
  • [45] Protein Complexes Discovery Based on Protein-Protein Interaction Data via a Regularized Sparse Generative Network Model
    Zhang, Xiao-Fei
    Dai, Dao-Qing
    Li, Xiao-Xin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 857 - 870
  • [46] LPBERT: A Protein-Protein Interaction Prediction Method Based on a Pre-Trained Language Model
    Hu, An
    Kuang, Linai
    Yang, Dinghai
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [47] MULTIMODAL PRE-TRAINING MODEL FOR SEQUENCE-BASED PREDICTION OF PROTEIN-PROTEIN INTERACTION
    Xue, Yang
    Liu, Zijing
    Fang, Xiaomin
    Wang, Fan
    MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 165, 2021, 165 : 34 - 46
  • [48] Graph-BERT and language model-based framework for protein-protein interaction identification
    Jha, Kanchan
    Karmakar, Sourav
    Saha, Sriparna
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] An efficient epileptic seizure detection based on tunable Q-wavelet transform and DCVAE-stacked Bi-LSTM model using electroencephalogram
    Sivasaravanababu, S.
    Prabhu, V
    Parthasarathy, V
    Mahendran, Rakesh Kumar
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2022, 231 (11-12): : 2425 - 2437
  • [50] An efficient epileptic seizure detection based on tunable Q-wavelet transform and DCVAE-stacked Bi-LSTM model using electroencephalogram
    S. Sivasaravanababu
    V. Prabhu
    V. Parthasarathy
    Rakesh Kumar Mahendran
    The European Physical Journal Special Topics, 2022, 231 : 2425 - 2437