MM-StackEns: A new deep multimodal stacked generalization approach for protein-protein interaction prediction

被引:12
|
作者
Albu, Alexandra-Ioana [1 ]
Bocicor, Maria-Iuliana [1 ]
Czibula, Gabriela [1 ]
机构
[1] Babes Bolyai Univ, Dept Comp Sci, 1 Mihail Kogalniceanu St, Cluj Napoca 400084, Romania
关键词
Neural networks; Protein-protein interaction prediction; Feature fusion; Contextualized word embeddings; Graph neural networks; SEMANTIC SIMILARITY; NEURAL-NETWORKS; CLASSIFIER; INFERENCE; SETS;
D O I
10.1016/j.compbiomed.2022.106526
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate in-silico identification of protein-protein interactions (PPIs) is a long-standing problem in biology, with important implications in protein function prediction and drug design. Current computational approaches predominantly use a single data modality for describing protein pairs, which may not fully capture the characteristics relevant for identifying PPIs. Another limitation of existing methods is their poor generalization to proteins outside the training graph. In this paper, we aim to address these shortcomings by proposing a new ensemble approach for PPI prediction, which learns information from two modalities, corresponding to pairs of sequences and to the graph formed by the training proteins and their interactions. Our approach uses a siamese neural network to process sequence information, while graph attention networks are employed for the network view. For capturing the relationships between the proteins in a pair, we design a new feature fusion module, based on computing the distance between the distributions corresponding to the two proteins. The prediction is made using a stacked generalization procedure, in which the final classifier is represented by a Logistic Regression model trained on the scores predicted by the sequence and graph models. Additionally, we show that protein sequence embeddings obtained using pretrained language models can significantly improve the generalization of PPI methods. The experimental results demonstrate the good performance of our approach, which surpasses all the related work on two Yeast data sets, while outperforming the majority of literature approaches on two Human data sets and on independent multi-species data sets.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme
    Chen, Kuan-Hsi
    Wang, Tsai-Feng
    Hu, Yuh-Jyh
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [2] Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme
    Kuan-Hsi Chen
    Tsai-Feng Wang
    Yuh-Jyh Hu
    BMC Bioinformatics, 20
  • [3] Multimodal Deep Representation Learning for Protein-Protein Interaction Networks
    Zhang, Da
    Kabuka, Mansur R.
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 595 - 602
  • [4] Revolutionizing protein-protein interaction prediction with deep learning
    Zhang, Jing
    Durham, Jesse
    Cong, Qian
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 85
  • [5] Deep learning frameworks for protein-protein interaction prediction
    Hu, Xiaotian
    Feng, Cong
    Ling, Tianyi
    Chen, Ming
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 3223 - 3233
  • [6] Protein-protein interaction prediction with deep learning: A comprehensive review
    Soleymani, Farzan
    Paquet, Eric
    Viktor, Herna
    Michalowski, Wojtek
    Spinello, Davide
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 5316 - 5341
  • [7] AutoPPI: An Ensemble of Deep Autoencoders for Protein-Protein Interaction Prediction
    Czibula, Gabriela
    Albu, Alexandra-Ioana
    Bocicor, Maria Iuliana
    Chira, Camelia
    ENTROPY, 2021, 23 (06)
  • [8] A mixture of feature experts approach for protein-protein interaction prediction
    Yanjun Qi
    Judith Klein-Seetharaman
    Ziv Bar-Joseph
    BMC Bioinformatics, 8
  • [9] A mixture of feature experts approach for protein-protein interaction prediction
    Qi, Yanjun
    Klein-Seetharaman, Judith
    Bar-Joseph, Ziv
    BMC BIOINFORMATICS, 2007, 8 (Suppl 10)
  • [10] Novel Domain Identification Approach for Protein-protein Interaction Prediction
    Shatnawi, Maad
    Zaki, Nazar M.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 145 - 152