Identification of Protein Complexes by Integrating Protein Abundance and Interaction Features Using a Deep Learning Strategy

被引:3
|
作者
Li, Bohui [1 ,2 ]
Altelaar, Maarten [1 ,2 ,3 ]
van Breukelen, Bas [1 ,2 ]
机构
[1] Biomol Mass Spectrometry & Prote, Padualaan 8, NL-3584 CH Utrecht, Netherlands
[2] Univ Utrecht, Utrecht Inst Pharmaceut Sci UIPS, Univ weg 99, NL-3584 CG Utrecht, Netherlands
[3] Netherlands Canc Inst, Mass Spectrometry & Prote Facil, NL-1066 CX Amsterdam, Netherlands
关键词
human protein-protein interaction; protein complexes; deep learning; data integration; proteomics; mass spectrometry; NUCLEAR IMPORT; NETWORK; DATABASE; RECRUITMENT; DYNAMICS; MCM10; PRC2;
D O I
10.3390/ijms24097884
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many essential cellular functions are carried out by multi-protein complexes that can be characterized by their protein-protein interactions. The interactions between protein subunits are critically dependent on the strengths of their interactions and their cellular abundances, both of which span orders of magnitude. Despite many efforts devoted to the global discovery of protein complexes by integrating large-scale protein abundance and interaction features, there is still room for improvement. Here, we integrated >7000 quantitative proteomic samples with three published affinity purification/co-fractionation mass spectrometry datasets into a deep learning framework to predict protein-protein interactions (PPIs), followed by the identification of protein complexes using a two-stage clustering strategy. Our deep-learning-technique-based classifier significantly outperformed recently published machine learning prediction models and in the process captured 5010 complexes containing over 9000 unique proteins. The vast majority of proteins in our predicted complexes exhibited low or no tissue specificity, which is an indication that the observed complexes tend to be ubiquitously expressed throughout all cell types and tissues. Interestingly, our combined approach increased the model sensitivity for low abundant proteins, which amongst other things allowed us to detect the interaction of MCM10, which connects to the replicative helicase complex via the MCM6 protein. The integration of protein abundances and their interaction features using a deep learning approach provided a comprehensive map of protein-protein interactions and a unique perspective on possible novel protein complexes.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Deep Contextual Representation Learning for Identifying Essential Proteins via Integrating Multisource Protein Features
    LI Weihua
    LIU Wenyang
    GUO Yanbu
    WANG Bingyi
    QING Hua
    Chinese Journal of Electronics, 2023, 32 (04) : 868 - 881
  • [22] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18
  • [23] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Sun, Tanlin
    Zhou, Bo
    Lai, Luhua
    Pei, Jianfeng
    BMC BIOINFORMATICS, 2017, 18
  • [24] Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning
    Kiouri, Despoina P.
    Batsis, Georgios C.
    Chasapis, Christos T.
    BIOMOLECULES, 2025, 15 (01)
  • [25] A New Strategy for Protein Interface Identification Using Manifold Learning Method
    Wang, Bing
    Huang, De-Shuang
    Jiang, Changjun
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (02) : 118 - 123
  • [26] Class-Balanced Protein Interaction Site Prediction Using Global and Local Features with XGBoost and Deep Learning
    Bharath C. Kulkarni
    Bommineni Shiva Sai
    Varun Kolagad
    Nagamma Patil
    Prajna Bhat
    SN Computer Science, 6 (2)
  • [27] Key protein identification by integrating protein complex information and multi-biological features
    Han, Yongyin
    Liu, Maolin
    Wang, Zhixiao
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (10) : 18191 - 18206
  • [28] Deep Learning Deciphers Protein–RNA Interaction
    Ming Li
    Genomics,Proteomics & Bioinformatics, 2019, (05) : 475 - 477
  • [29] Label-free identification of protein aggregates using deep learning
    Ibrahim K.A.
    Grußmayer K.S.
    Riguet N.
    Feletti L.
    Lashuel H.A.
    Radenovic A.
    Nature Communications, 14 (1)
  • [30] Identification of sub-Golgi protein localization by use of deep representation learning features
    Lv, Zhibin
    Wang, Pingping
    Zou, Quan
    Jiang, Qinghua
    BIOINFORMATICS, 2021, 36 (24) : 5600 - 5609