A novel method for identifying disease associated protein complexes based on functional similarity protein complex networks

被引:13
|
作者
Le, Duc-Hau [1 ]
机构
[1] Water Resources Univ, Sch Comp Sci & Engn, Hanoi, Vietnam
关键词
Disease protein complex; Functional similarity protein complex network; Neighborhood-based algorithm; Prostate cancer; GENE ONTOLOGY; CANCER; IDENTIFICATION; PREDICT;
D O I
10.1186/s13015-015-0044-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Protein complexes formed by non-covalent interaction among proteins play important roles in cellular functions. Computational and purification methods have been used to identify many protein complexes and their cellular functions. However, their roles in terms of causing disease have not been well discovered yet. There exist only a few studies for the identification of disease-associated protein complexes. However, they mostly utilize complicated heterogeneous networks which are constructed based on an out-of-date database of phenotype similarity network collected from literature. In addition, they only apply for diseases for which tissue-specific data exist. Methods: In this study, we propose a method to identify novel disease-protein complex associations. First, we introduce a framework to construct functional similarity protein complex networks where two protein complexes are functionally connected by either shared protein elements, shared annotating GO terms or based on protein interactions between elements in each protein complex. Second, we propose a simple but effective neighborhood-based algorithm, which yields a local similarity measure, to rank disease candidate protein complexes. Results: Comparing the predictive performance of our proposed algorithm with that of two state-of-the-art network propagation algorithms including one we used in our previous study, we found that it performed statistically significantly better than that of these two algorithms for all the constructed functional similarity protein complex networks. In addition, it ran about 32 times faster than these two algorithms. Moreover, our proposed method always achieved high performance in terms of AUC values irrespective of the ways to construct the functional similarity protein complex networks and the used algorithms. The performance of our method was also higher than that reported in some existing methods which were based on complicated heterogeneous networks. Finally, we also tested our method with prostate cancer and selected the top 100 highly ranked candidate protein complexes. Interestingly, 69 of them were evidenced since at least one of their protein elements are known to be associated with prostate cancer. Conclusions: Our proposed method, including the framework to construct functional similarity protein complex networks and the neighborhood-based algorithm on these networks, could be used for identification of novel disease-protein complex associations.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A novel method for identifying disease associated protein complexes based on functional similarity protein complex networks
    Duc-Hau Le
    [J]. Algorithms for Molecular Biology, 10
  • [2] A novel protein complex identifying method based on key protein (PCIM)
    Zhao, Junmin
    Zhang, Jingpu
    Ma, Yuanyuan
    Yang, Bin
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 23 (04) : 343 - 359
  • [3] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Jian Wang
    Dong Xie
    Hongfei Lin
    Zhihao Yang
    Yijia Zhang
    [J]. Proteome Science, 10
  • [4] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    [J]. PROTEOME SCIENCE, 2012, 10
  • [5] Identifying Protein Complexes from PPI Networks Using GO Semantic Similarity
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 582 - 585
  • [6] Identifying functions of protein complexes based on topology similarity with random forest
    Li, Zhan-Chao
    Lai, Yan-Hua
    Chen, Li-Li
    Xie, Yun
    Dai, Zong
    Zou, Xiao-Yong
    [J]. MOLECULAR BIOSYSTEMS, 2014, 10 (03) : 514 - 525
  • [7] A Core-Attach Based Method for Identifying Protein Complexes in Dynamic PPI Networks
    Luo, Jiawei
    Liu, Chengchen
    Hoang Tu Nguyen
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 228 - 239
  • [8] Identifying Protein Complexes in Dynamic Protein-Protein Interaction Networks Based on Cuckoo Search Algorithm
    Zhao, Jie
    Lei, Xiujuan
    Wu, Fang-Xiang
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1288 - 1295
  • [9] Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks
    Xiaoxia Liu
    Zhihao Yang
    Shengtian Sang
    Ziwei Zhou
    Lei Wang
    Yin Zhang
    Hongfei Lin
    Jian Wang
    Bo Xu
    [J]. BMC Bioinformatics, 19
  • [10] A novel method to predict protein complexes based on Gene Ontology in PPI networks
    [J]. Luo, J. (luojiawei@hnu.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):