Identification of Protein Complexes by Integrating Protein Abundance and Interaction Features Using a Deep Learning Strategy

被引:3
|
作者
Li, Bohui [1 ,2 ]
Altelaar, Maarten [1 ,2 ,3 ]
van Breukelen, Bas [1 ,2 ]
机构
[1] Biomol Mass Spectrometry & Prote, Padualaan 8, NL-3584 CH Utrecht, Netherlands
[2] Univ Utrecht, Utrecht Inst Pharmaceut Sci UIPS, Univ weg 99, NL-3584 CG Utrecht, Netherlands
[3] Netherlands Canc Inst, Mass Spectrometry & Prote Facil, NL-1066 CX Amsterdam, Netherlands
关键词
human protein-protein interaction; protein complexes; deep learning; data integration; proteomics; mass spectrometry; NUCLEAR IMPORT; NETWORK; DATABASE; RECRUITMENT; DYNAMICS; MCM10; PRC2;
D O I
10.3390/ijms24097884
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many essential cellular functions are carried out by multi-protein complexes that can be characterized by their protein-protein interactions. The interactions between protein subunits are critically dependent on the strengths of their interactions and their cellular abundances, both of which span orders of magnitude. Despite many efforts devoted to the global discovery of protein complexes by integrating large-scale protein abundance and interaction features, there is still room for improvement. Here, we integrated >7000 quantitative proteomic samples with three published affinity purification/co-fractionation mass spectrometry datasets into a deep learning framework to predict protein-protein interactions (PPIs), followed by the identification of protein complexes using a two-stage clustering strategy. Our deep-learning-technique-based classifier significantly outperformed recently published machine learning prediction models and in the process captured 5010 complexes containing over 9000 unique proteins. The vast majority of proteins in our predicted complexes exhibited low or no tissue specificity, which is an indication that the observed complexes tend to be ubiquitously expressed throughout all cell types and tissues. Interestingly, our combined approach increased the model sensitivity for low abundant proteins, which amongst other things allowed us to detect the interaction of MCM10, which connects to the replicative helicase complex via the MCM6 protein. The integration of protein abundances and their interaction features using a deep learning approach provided a comprehensive map of protein-protein interactions and a unique perspective on possible novel protein complexes.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Identification of protein complexes from multi-relationship protein interaction networks
    Li, Xueyong
    Wang, Jianxin
    Zhao, Bihai
    Wu, Fang-Xiang
    Pan, Yi
    HUMAN GENOMICS, 2016, 10
  • [42] Protein design using deep learning
    Baker, David
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (03) : S164 - S164
  • [43] Accurate refinement of docked protein complexes using evolutionary information and deep learning
    Akbal-Delibas, Bahar
    Farhoodi, Roshanak
    Pomplun, Marc
    Haspel, Nurit
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (03)
  • [44] Protein-protein interaction identification using a hybrid model
    Niu, Yun
    Wang, Yuwei
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2015, 64 (03) : 185 - 193
  • [45] Drug Protein Interaction Extraction Using SciBERT Based Deep Learning Model
    GabAllah, Nada
    Rafea, Ahmed
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 157 - 165
  • [46] Protein Features Identification for Machine Learning-Based Prediction of Protein-Protein Interactions
    Raza, Khalid
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY, 2017, 750 : 305 - 317
  • [47] Semi-supervised Learning Protein Complexes from Protein Interaction Networks
    Shi, Lei
    Zhang, Aidong
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 247 - 252
  • [48] Predicting Protein Complexes in Protein Interaction Networks: A Supervised Learning Based Method
    Yu, Fengying
    Yang, Zhihao
    Tang, Nan
    Wu, Jiajin
    Lin, Hongfei
    Wang, Jian
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [49] An integration of deep learning with feature embedding for protein-protein interaction prediction
    Yao, Yu
    Du, Xiuquan
    Diao, Yanyu
    Zhu, Huaixu
    PEERJ, 2019, 7
  • [50] SENSDeep: An Ensemble Deep Learning Method for Protein–Protein Interaction Sites Prediction
    Engin Aybey
    Özgür Gümüş
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 55 - 87