Modifying the DPClus algorithm for identifying protein complexes based on new topological structures

被引:194
|
作者
Li, Min [1 ]
Chen, Jian-er [1 ,2 ]
Wang, Jian-xin [1 ]
Hu, Bin [1 ]
Chen, Gang [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
D O I
10.1186/1471-2105-9-398
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect significant complexes in such networks. Results: Based on the study of known complexes in protein networks, this paper proposes a new topological structure for protein complexes, which is a combination of subgraph diameter (or average vertex distance) and subgraph density. Following the approach of that of the previously proposed clustering algorithm DPClus which expands clusters starting from seeded vertices, we present a clustering algorithm IPCA based on the new topological structure for identifying complexes in large protein interaction networks. The algorithm IPCA is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Experimental results show that the algorithm IPCA recalls more known complexes than previously proposed clustering algorithms, including DPClus, CFinder, LCMA, MCODE, RNSC and STM. Conclusion: The proposed algorithm based on the new topological structure makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes. The algorithm is robust to the known high rate of false positives and false negatives in data from high-throughout interaction techniques. The program is available at http://netlab.csu.edu.cn/bioinformatics/limin/ IPCA.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Detecting overlapping protein complexes based on a generative model with functional and topological properties
    Zhang, Xiao-Fei
    Dai, Dao-Qing
    Le Ou-Yang
    Yan, Hong
    BMC BIOINFORMATICS, 2014, 15
  • [42] A New Sequential Forward Feature Selection (SFFS) Algorithm for Mining Best Topological and Biological Features to Predict Protein Complexes from Protein-Protein Interaction Networks (PPINs)
    Younis, Haseeb
    Anwar, Muhammad Waqas
    Khan, Muhammad Usman Ghani
    Sikandar, Aisha
    Bajwa, Usama Ijaz
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 371 - 388
  • [43] Overlapping Protein Complexes Detection Based on Multi-level Topological Similarities
    Wang, Wenkang
    Meng, Xiangmao
    Xiang, Ju
    Li, Min
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 215 - 226
  • [44] Detecting overlapping protein complexes based on a generative model with functional and topological properties
    Xiao-Fei Zhang
    Dao-Qing Dai
    Le Ou-Yang
    Hong Yan
    BMC Bioinformatics, 15
  • [45] A modifying algorithm of the topological VLSI layer by dummy filling features based on modeling the chemical-mechanical planarization
    Amirkhanov A.V.
    Gladkykh A.A.
    Makarchuk V.V.
    Stolyarov A.A.
    Shakhnov V.A.
    Russian Microelectronics, 2014, 43 (01) : 72 - 79
  • [46] UDoNC: An Algorithm for Identifying Essential Proteins Based on Protein Domains and Protein-Protein Interaction Networks
    Peng, Wei
    Wang, Jianxin
    Cheng, Yingjiao
    Lu, Yu
    Wu, Fangxiang
    Pan, Yi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (02) : 276 - 288
  • [47] Structure prediction of protein complexes by an NMR-based protein docking algorithm
    Kohlbacher, O
    Burchardt, A
    Moll, A
    Hildebrandt, A
    Bayer, P
    Lenhof, HP
    JOURNAL OF BIOMOLECULAR NMR, 2001, 20 (01) : 15 - 21
  • [48] Structure prediction of protein complexes by an NMR-based protein docking algorithm
    Oliver Kohlbacher
    Andreas Burchardt
    Andreas Moll
    Andreas Hildebrandt
    Peter Bayer
    Hans-Peter Lenhof
    Journal of Biomolecular NMR, 2001, 20 : 15 - 21
  • [49] STRUCTURE PREDICTION ALGORITHM FOR PROTEIN COMPLEXES BASED ON GENE ONTOLOGY
    Hadarovich, Anna Yu
    Anishchenko, Ivan, V
    Kundrotas, Petras
    Vakser, Ilya
    Tuzikov, Alexander, V
    DOKLADY NATSIONALNOI AKADEMII NAUK BELARUSI, 2020, 64 (02): : 150 - 158
  • [50] Identification of protein complexes algorithm based on random walk model
    Dong Xuantong
    Lin Zhijie
    Ren Yuan
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 383 - 388