An Ensemble Learning Framework for Detecting Protein Complexes From PPI Networks

被引:6
|
作者
Wang, Rongquan [1 ]
Ma, Huimin [1 ]
Wang, Caixia [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] China Foreign Affairs Univ, Sch Int Econ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
protein complexes; protein-protein interaction networks; graph clustering algorithms; ensemble learning; network embedding; biological information; FUNCTIONAL MODULES; IDENTIFICATION; ALGORITHM; ANNOTATION; DISCOVERY; MIPS;
D O I
10.3389/fgene.2022.839949
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Detecting protein complexes is one of the keys to understanding cellular organization and processes principles. With high-throughput experiments and computing science development, it has become possible to detect protein complexes by computational methods. However, most computational methods are based on either unsupervised learning or supervised learning. Unsupervised learning-based methods do not need training datasets, but they can only detect one or several topological protein complexes. Supervised learning-based methods can detect protein complexes with different topological structures. However, they are usually based on a type of training model, and the generalization of a single model is poor. Therefore, we propose an Ensemble Learning Framework for Detecting Protein Complexes (ELF-DPC) within protein-protein interaction (PPI) networks to address these challenges. The ELF-DPC first constructs the weighted PPI network by combining topological and biological information. Second, it mines protein complex cores using the protein complex core mining strategy we designed. Third, it obtains an ensemble learning model by integrating structural modularity and a trained voting regressor model. Finally, it extends the protein complex cores and forms protein complexes by a graph heuristic search strategy. The experimental results demonstrate that ELF-DPC performs better than the twelve state-of-the-art approaches. Moreover, functional enrichment analysis illustrated that ELF-DPC could detect biologically meaningful protein complexes. The code/dataset is available for free download from https://github.com/RongquanWang/ELF-DPC.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Detecting overlapping protein complexes in protein-protein interaction networks
    Nepusz T.
    Yu H.
    Paccanaro A.
    Nature Methods, 2012, 9 (5) : 471 - 472
  • [42] Prediction of problematic complexes from PPI networks: sparse, embedded, and small complexes
    Yong, Chern Han
    Wong, Limsoon
    BIOLOGY DIRECT, 2015, 10
  • [43] Prediction of problematic complexes from PPI networks: sparse, embedded, and small complexes
    Chern Han Yong
    Limsoon Wong
    Biology Direct, 10
  • [44] Detecting protein complexes and functional modules from protein interaction networks: A graph entropy approach
    Kenley, Edward Casey
    Cho, Young-Rae
    PROTEOMICS, 2011, 11 (19) : 3835 - 3844
  • [45] A partially shared joint clustering framework for detecting protein complexes from multiple state-specific signed interaction networks
    Zhan, Youlin
    Liu, Jiahan
    Wu, Min
    Tan, Chris Soon Heng
    Li, Xiaoli
    Ou-Yang, Le
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 159
  • [46] Discovering overlapped protein complexes from weighted PPI networks by removing inter-module hubs
    Maddi, A. M. A.
    Eslahchi, Ch.
    SCIENTIFIC REPORTS, 2017, 7
  • [47] Genetic Algorithm with Ensemble Learning for Detecting Community Structure in Complex Networks
    He, Dongxiao
    Wang, Zhe
    Yang, Bin
    Zhou, Chunguang
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 702 - 707
  • [48] Discovering overlapped protein complexes from weighted PPI networks by removing inter-module hubs
    A. M. A. Maddi
    Ch. Eslahchi
    Scientific Reports, 7
  • [49] AN ONLINE ENSEMBLE LEARNING MODEL FOR DETECTING ATTACKS IN WIRELESS SENSOR NETWORKS
    Tabbaa, Hiba
    Ifzarne, Samir
    Hafidi, Imad
    COMPUTING AND INFORMATICS, 2023, 42 (04) : 1013 - 1036
  • [50] Identification of protein complexes and functional modules in E. coli PPI networks
    Ping Kong
    Gang Huang
    Wei Liu
    BMC Microbiology, 20