An ensemble framework for clustering protein-protein interaction networks

被引:130
|
作者
Asur, Sitaram [1 ]
Ucar, Duygu [1 ]
Parthasarathy, Srinivasan [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
D O I
10.1093/bioinformatics/btm212
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein - Protein Interaction (PPI) networks are believed to be important sources of information related to biological processes and complex metabolic functions of the cell. The presence of biologically relevant functional modules in these networks has been theorized by many researchers. However, the application of traditional clustering algorithms for extracting these modules has not been successful, largely due to the presence of noisy false positive interactions as well as specific topological challenges in the network. Results: In this article, we propose an ensemble clustering framework to address this problem. For base clustering, we introduce two topology-based distance metrics to counteract the effects of noise. We develop a PCA-based consensus clustering technique, designed to reduce the dimensionality of the consensus problem and yield informative clusters. We also develop a soft consensus clustering variant to assign multifaceted proteins to multiple functional groups. We conduct an empirical evaluation of different consensus techniques using topology-based, information theoretic and domain-specific validation metrics and show that our approaches can provide significant benefits over other state-of-the-art approaches. Our analysis of the consensus clusters obtained demonstrates that ensemble clustering can (a) produce improved biologically significant functional groupings; and (b) facilitate soft clustering by discovering multiple functional associations for proteins.
引用
下载
收藏
页码:I29 / I40
页数:12
相关论文
共 50 条
  • [31] Efficient and accurate identification of protein complexes from protein-protein interaction networks based on the clustering coefficient
    Omranian, Sara
    Angeleska, Angela
    Nikoloski, Zoran
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 5255 - 5263
  • [32] Protein complex identification through Markov clustering with firefly algorithm on dynamic protein-protein interaction networks
    Lei, Xiujuan
    Wang, Fei
    Wu, Fang-Xiang
    Zhang, Aidong
    Pedrycz, Witold
    INFORMATION SCIENCES, 2016, 329 : 303 - 316
  • [33] Detecting Overlapping Protein Complexes in Dynamic Protein-Protein Interaction Networks by Developing a Fuzzy Clustering Algorithm
    Yin, Ruiping
    Li, Kan
    Zhang, Guangquan
    Lu, Jie
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [34] Hub Promiscuity in Protein-Protein Interaction Networks
    Patil, Ashwini
    Kinoshita, Kengo
    Nakamura, Haruki
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2010, 11 (04) : 1930 - 1943
  • [35] Uncovering the structure of protein-protein interaction networks
    Przulj, N.
    Corneil, D.
    Jurisica, I.
    MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (08) : S54 - S54
  • [36] Unified Alignment of Protein-Protein Interaction Networks
    Malod-Dognin, Noel
    Ban, Kristina
    Przulj, Natasa
    SCIENTIFIC REPORTS, 2017, 7
  • [37] Interdependent Patterns in Protein-Protein Interaction Networks
    Sun, Peng Gang
    Quan, Yining
    Miao, Qiguang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 3257 - 3265
  • [38] Unified Alignment of Protein-Protein Interaction Networks
    Noël Malod-Dognin
    Kristina Ban
    Nataša Pržulj
    Scientific Reports, 7
  • [39] Evolution of protein-protein interaction networks in yeast
    Schoenrock, Andrew
    Burnside, Daniel
    Moteshareie, Houman
    Pitre, Sylvain
    Hooshyar, Mohsen
    Green, James R.
    Golshani, Ashkan
    Dehne, Frank
    Wong, Alex
    PLOS ONE, 2017, 12 (03):
  • [40] Communities Analysis in Protein-protein Interaction Networks
    Li, Kan
    Pang, Yin
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,