An Ensemble Learning Framework for Detecting Protein Complexes From PPI Networks

被引:5
|
作者
Wang, Rongquan [1 ]
Ma, Huimin [1 ]
Wang, Caixia [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] China Foreign Affairs Univ, Sch Int Econ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
protein complexes; protein-protein interaction networks; graph clustering algorithms; ensemble learning; network embedding; biological information; FUNCTIONAL MODULES; IDENTIFICATION; ALGORITHM; ANNOTATION; DISCOVERY; MIPS;
D O I
10.3389/fgene.2022.839949
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Detecting protein complexes is one of the keys to understanding cellular organization and processes principles. With high-throughput experiments and computing science development, it has become possible to detect protein complexes by computational methods. However, most computational methods are based on either unsupervised learning or supervised learning. Unsupervised learning-based methods do not need training datasets, but they can only detect one or several topological protein complexes. Supervised learning-based methods can detect protein complexes with different topological structures. However, they are usually based on a type of training model, and the generalization of a single model is poor. Therefore, we propose an Ensemble Learning Framework for Detecting Protein Complexes (ELF-DPC) within protein-protein interaction (PPI) networks to address these challenges. The ELF-DPC first constructs the weighted PPI network by combining topological and biological information. Second, it mines protein complex cores using the protein complex core mining strategy we designed. Third, it obtains an ensemble learning model by integrating structural modularity and a trained voting regressor model. Finally, it extends the protein complex cores and forms protein complexes by a graph heuristic search strategy. The experimental results demonstrate that ELF-DPC performs better than the twelve state-of-the-art approaches. Moreover, functional enrichment analysis illustrated that ELF-DPC could detect biologically meaningful protein complexes. The code/dataset is available for free download from https://github.com/RongquanWang/ELF-DPC.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Spectral Clustering for Detecting Protein Complexes in PPI Networks
    Qin, Guimin
    Gao, Lin
    [J]. 2009 FOURTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PROCEEDINGS, 2009, : 175 - 182
  • [2] Detecting overlapping protein complexes in PPI networks based on robustness
    Wang, Shuliang
    Wu, Fang
    [J]. PROTEOME SCIENCE, 2013, 11
  • [3] Detecting overlapping protein complexes in PPI networks based on robustness
    Shuliang Wang
    Fang Wu
    [J]. Proteome Science, 11
  • [4] Spectral clustering for detecting protein complexes in protein-protein interaction (PPI) networks
    Qin, Guimin
    Gao, Lin
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) : 2066 - 2074
  • [5] CPredictor 4.0: effectively detecting protein complexes in weighted dynamic PPI networks
    Shi, Yunjia
    Yao, Heng
    Guan, Jihong
    Zhou, Shuigeng
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (04) : 303 - 319
  • [6] CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
    Xu, Ying
    Zhou, Jiaogen
    Zhou, Shuigeng
    Guan, Jihong
    [J]. BMC SYSTEMS BIOLOGY, 2017, 11
  • [7] Protein Complexes Prediction via Positive and Unlabeled Learning of the PPI networks
    Zhao, Jichao
    Liang, Xun
    Wang, Yi
    Xu, Zhiming
    Liu, Yu
    [J]. 2016 13TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, 2016,
  • [8] EnPC: An Ensemble Clustering Framework for Detecting Protein Complexes in Protein-Protein Interaction Network
    Dai, Qiguo
    Duan, Xiaodong
    Guo, Maozu
    Guo, Yingjie
    [J]. CURRENT PROTEOMICS, 2016, 13 (02) : 143 - 150
  • [9] Identifying protein complexes and functional modules-from static PPI networks to dynamic PPI networks
    Chen, Bolin
    Fan, Weiwei
    Liu, Juan
    Wu, Fang-Xiang
    [J]. BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 177 - 194
  • [10] Unsupervised methods for finding protein complexes from PPI networks
    Sharma P.
    Ahmed H.A.
    Roy S.
    Bhattacharyya D.K.
    [J]. Network Modeling Analysis in Health Informatics and Bioinformatics, 2015, 4 (1)