An ensemble framework for clustering protein-protein interaction networks

被引:130
|
作者
Asur, Sitaram [1 ]
Ucar, Duygu [1 ]
Parthasarathy, Srinivasan [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
D O I
10.1093/bioinformatics/btm212
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein - Protein Interaction (PPI) networks are believed to be important sources of information related to biological processes and complex metabolic functions of the cell. The presence of biologically relevant functional modules in these networks has been theorized by many researchers. However, the application of traditional clustering algorithms for extracting these modules has not been successful, largely due to the presence of noisy false positive interactions as well as specific topological challenges in the network. Results: In this article, we propose an ensemble clustering framework to address this problem. For base clustering, we introduce two topology-based distance metrics to counteract the effects of noise. We develop a PCA-based consensus clustering technique, designed to reduce the dimensionality of the consensus problem and yield informative clusters. We also develop a soft consensus clustering variant to assign multifaceted proteins to multiple functional groups. We conduct an empirical evaluation of different consensus techniques using topology-based, information theoretic and domain-specific validation metrics and show that our approaches can provide significant benefits over other state-of-the-art approaches. Our analysis of the consensus clusters obtained demonstrates that ensemble clustering can (a) produce improved biologically significant functional groupings; and (b) facilitate soft clustering by discovering multiple functional associations for proteins.
引用
下载
收藏
页码:I29 / I40
页数:12
相关论文
共 50 条
  • [1] EnPC: An Ensemble Clustering Framework for Detecting Protein Complexes in Protein-Protein Interaction Network
    Dai, Qiguo
    Duan, Xiaodong
    Guo, Maozu
    Guo, Yingjie
    CURRENT PROTEOMICS, 2016, 13 (02) : 143 - 150
  • [2] Clustering coefficients of protein-protein interaction networks
    Miller, Gerald A.
    Shi, Yi Y.
    Qian, Hong
    Bomsztyk, Karol
    PHYSICAL REVIEW E, 2007, 75 (05):
  • [3] Clustering and Summarizing Protein-Protein Interaction Networks: A Survey
    Bhowmick, Sourav S.
    Seah, Boon Siew
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) : 638 - 658
  • [4] Evaluation of clustering algorithms for protein-protein interaction networks
    Sylvain Brohée
    Jacques van Helden
    BMC Bioinformatics, 7
  • [5] Evaluation of clustering algorithms for protein-protein interaction networks
    Brohee, Sylvain
    van Helden, Jacques
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [6] Spectral clustering for detecting protein complexes in protein-protein interaction (PPI) networks
    Qin, Guimin
    Gao, Lin
    MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) : 2066 - 2074
  • [7] Diffusion Model Based Spectral Clustering for Protein-Protein Interaction Networks
    Inoue, Kentaro
    Li, Weijiang
    Kurata, Hiroyuki
    PLOS ONE, 2010, 5 (09): : 1 - 10
  • [8] Inferring topology from clustering coefficients in protein-protein interaction networks
    Friedel, Caroline C.
    Zimmer, Ralf
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [9] Inferring topology from clustering coefficients in protein-protein interaction networks
    Caroline C Friedel
    Ralf Zimmer
    BMC Bioinformatics, 7
  • [10] A hybrid clustering algorithm for identifying modules in Protein-Protein Interaction networks
    Yu, Liang
    Gao, Lin
    Sun, Peng Gang
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (05) : 600 - 615