A Max-Flow-Based Approach to the Identification of Protein Complexes Using Protein Interaction and Microarray Data

被引:40
|
作者
Feng, Jianxing [1 ]
Jiang, Rui [2 ]
Jiang, Tao [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, MOE Key Lab Bioinformat, Bioinformat Div, TNLIST,Dept Automat, Beijing 100084, Peoples R China
[3] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Protein complex; protein-protein interaction network; microarray; dense subgraph; maximum network flow; efficient algorithm; FUNCTIONAL MODULES; INTERACTION NETWORKS; EXPRESSION PROFILES; YEAST; ALGORITHM;
D O I
10.1109/TCBB.2010.78
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The emergence of high-throughput technologies leads to abundant protein-protein interaction (PPI) data and microarray gene expression profiles, and provides a great opportunity for the identification of novel protein complexes using computational methods. By combining these two types of data, we propose a novel Graph Fragmentation Algorithm (GFA) for protein complex identification. Adapted from a classical max-flow algorithm for finding the (weighted) densest subgraphs, GFA first finds large (weighted) dense subgraphs in a protein-protein interaction network, and then, breaks each such subgraph into fragments iteratively by weighting its nodes appropriately in terms of their corresponding log-fold changes in the microarray data, until the fragment subgraphs are sufficiently small. Our tests on three widely used protein-protein interaction data sets and comparisons with several latest methods for protein complex identification demonstrate the strong performance of our method in predicting novel protein complexes in terms of its specificity and efficiency. Given the high specificity (or precision) that our method has achieved, we conjecture that our prediction results imply more than 200 novel protein complexes.
引用
收藏
页码:621 / 634
页数:14
相关论文
共 50 条
  • [21] High throughput protein-protein interaction data: clues for the architecture of protein complexes
    Krycer, James R.
    Pang, Chi Nam Ignatius
    Wilkins, Marc R.
    PROTEOME SCIENCE, 2008, 6 (1)
  • [22] Prediction of protein functions using protein interaction data
    Jung, H
    Han, K
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 2, PROCEEDINGS, 2004, 3037 : 317 - 324
  • [23] An uncertain model-based approach for identifying dynamic protein complexes in uncertain protein-protein interaction networks
    Yijia Zhang
    Hongfei Lin
    Zhihao Yang
    Jian Wang
    Yiwei Liu
    BMC Genomics, 18
  • [24] An uncertain model-based approach for identifying dynamic protein complexes in uncertain protein-protein interaction networks
    Zhang, Yijia
    Lin, Hongfei
    Yang, Zhihao
    Wang, Jian
    Liu, Yiwei
    BMC GENOMICS, 2017, 18
  • [25] Protein Complexes Discovery Based on Protein-Protein Interaction Data via a Regularized Sparse Generative Network Model
    Zhang, Xiao-Fei
    Dai, Dao-Qing
    Li, Xiao-Xin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 857 - 870
  • [26] Predicting protein function from protein/protein interaction data: a probabilistic approach
    Letovsky, Stanley
    Kasif, Simon
    BIOINFORMATICS, 2003, 19 : i197 - i204
  • [27] Identification of protein complexes from multi-relationship protein interaction networks
    Xueyong Li
    Jianxin Wang
    Bihai Zhao
    Fang-Xiang Wu
    Yi Pan
    Human Genomics, 10
  • [28] Identification of protein complexes from multi-relationship protein interaction networks
    Li, Xueyong
    Wang, Jianxin
    Zhao, Bihai
    Wu, Fang-Xiang
    Pan, Yi
    HUMAN GENOMICS, 2016, 10
  • [29] High-throughput prediction of protein antigenicity using protein microarray data
    Magnan, Christophe N.
    Zeller, Michael
    Kayala, Matthew A.
    Vigil, Adam
    Randall, Arlo
    Felgner, Philip L.
    Baldi, Pierre
    BIOINFORMATICS, 2010, 26 (23) : 2936 - 2943
  • [30] Novel Domain Identification Approach for Protein-protein Interaction Prediction
    Shatnawi, Maad
    Zaki, Nazar M.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 145 - 152