MM-StackEns: A new deep multimodal stacked generalization approach for protein-protein interaction prediction

被引:12
|
作者
Albu, Alexandra-Ioana [1 ]
Bocicor, Maria-Iuliana [1 ]
Czibula, Gabriela [1 ]
机构
[1] Babes Bolyai Univ, Dept Comp Sci, 1 Mihail Kogalniceanu St, Cluj Napoca 400084, Romania
关键词
Neural networks; Protein-protein interaction prediction; Feature fusion; Contextualized word embeddings; Graph neural networks; SEMANTIC SIMILARITY; NEURAL-NETWORKS; CLASSIFIER; INFERENCE; SETS;
D O I
10.1016/j.compbiomed.2022.106526
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate in-silico identification of protein-protein interactions (PPIs) is a long-standing problem in biology, with important implications in protein function prediction and drug design. Current computational approaches predominantly use a single data modality for describing protein pairs, which may not fully capture the characteristics relevant for identifying PPIs. Another limitation of existing methods is their poor generalization to proteins outside the training graph. In this paper, we aim to address these shortcomings by proposing a new ensemble approach for PPI prediction, which learns information from two modalities, corresponding to pairs of sequences and to the graph formed by the training proteins and their interactions. Our approach uses a siamese neural network to process sequence information, while graph attention networks are employed for the network view. For capturing the relationships between the proteins in a pair, we design a new feature fusion module, based on computing the distance between the distributions corresponding to the two proteins. The prediction is made using a stacked generalization procedure, in which the final classifier is represented by a Logistic Regression model trained on the scores predicted by the sequence and graph models. Additionally, we show that protein sequence embeddings obtained using pretrained language models can significantly improve the generalization of PPI methods. The experimental results demonstrate the good performance of our approach, which surpasses all the related work on two Yeast data sets, while outperforming the majority of literature approaches on two Human data sets and on independent multi-species data sets.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] NOXclass: prediction of protein-protein interaction types
    Zhu, HB
    Domingues, FS
    Sommer, I
    Lengauer, T
    BMC BIOINFORMATICS, 2006, 7
  • [22] MFC-PPI: protein-protein interaction prediction with multimodal feature fusion and contrastive learning
    Zhang, Zhixin
    Zhang, Qunhao
    Xiao, Jun
    Ding, Shanyang
    Li, Zhen
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [23] NOXclass: Prediction of protein-protein interaction types
    Max-Planck-Institut für Informatik, Stuhlsatzenhausweg 85, 66123 Saarbrücken, Germany
    BMC Bioinform., 2006,
  • [24] MULTIMODAL PRE-TRAINING MODEL FOR SEQUENCE-BASED PREDICTION OF PROTEIN-PROTEIN INTERACTION
    Xue, Yang
    Liu, Zijing
    Fang, Xiaomin
    Wang, Fan
    MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 165, 2021, 165 : 34 - 46
  • [25] NOXclass: prediction of protein-protein interaction types
    Hongbo Zhu
    Francisco S Domingues
    Ingolf Sommer
    Thomas Lengauer
    BMC Bioinformatics, 7 (1)
  • [26] Construction and prediction of protein-protein interaction maps
    Schächter, V
    BIOINFORMATICS AND GENOME ANALYSIS, 2002, 38 : 191 - 220
  • [27] Protein-Protein Interaction Prediction: Recent Advances
    Shatnawi, Maad
    2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 69 - 73
  • [28] Protein-Protein Interaction: Prediction, Design, and Modulation
    Zhang Chang-Sheng
    Lai Lu-Hua
    ACTA PHYSICO-CHIMICA SINICA, 2012, 28 (10) : 2363 - 2380
  • [29] Discrimination and Prediction of Protein-Protein Binding Affinity Using Deep Learning Approach
    Nikam, Rahul
    Yugandhar, K.
    Gromiha, M. Michael
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 809 - 815
  • [30] CF-PPI: Centroid based new feature extraction approach for Protein-Protein Interaction Prediction
    Sahni, Gunjan
    Mewara, Bhawna
    Lalwani, Soniya
    Kumar, Rajesh
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (07) : 1037 - 1057