A novel essential protein identification method based on PPI networks and gene expression data

被引:35
|
作者
Zhong, Jiancheng [1 ,2 ]
Tang, Chao [1 ]
Peng, Wei [3 ]
Xie, Minzhu [1 ]
Sun, Yusui [1 ]
Tang, Qiang [4 ]
Xiao, Qiu [1 ]
Yang, Jiahong [1 ]
机构
[1] Hunan Normal Univ, Sch Informat Sci & Engn, Changsha 410081, Peoples R China
[2] Hunan Prov Key Lab Intelligent Comp & Language In, Hunan Prov Key Lab Bioinformat, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Kunming Univ Sci & Technol, Coll Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[4] Hunan Normal Univ, Coll Engn & Design, Changsha 410081, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential proteins; The PPI networks; Jaccard similarity index; Edge clustering coefficient; SUBCELLULAR-LOCALIZATION; GENOME; CENTRALITY; PREDICTION; ANNOTATION; ORTHOLOGY; DELETION;
D O I
10.1186/s12859-021-04175-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundSome proposed methods for identifying essential proteins have better results by using biological information. Gene expression data is generally used to identify essential proteins. However, gene expression data is prone to fluctuations, which may affect the accuracy of essential protein identification. Therefore, we propose an essential protein identification method based on gene expression and the PPI network data to calculate the similarity of "active" and "inactive" state of gene expression in a cluster of the PPI network. Our experiments show that the method can improve the accuracy in predicting essential proteins.ResultsIn this paper, we propose a new measure named JDC, which is based on the PPI network data and gene expression data. The JDC method offers a dynamic threshold method to binarize gene expression data. After that, it combines the degree centrality and Jaccard similarity index to calculate the JDC score for each protein in the PPI network. We benchmark the JDC method on four organisms respectively, and evaluate our method by using ROC analysis, modular analysis, jackknife analysis, overlapping analysis, top analysis, and accuracy analysis. The results show that the performance of JDC is better than DC, IC, EC, SC, BC, CC, NC, PeC, and WDC. We compare JDC with both NF-PIN and TS-PIN methods, which predict essential proteins through active PPI networks constructed from dynamic gene expression.ConclusionsWe demonstrate that the new centrality measure, JDC, is more efficient than state-of-the-art prediction methods with same input. The main ideas behind JDC are as follows: (1) Essential proteins are generally densely connected clusters in the PPI network. (2) Binarizing gene expression data can screen out fluctuations in gene expression profiles. (3) The essentiality of the protein depends on the similarity of "active" and "inactive" state of gene expression in a cluster of the PPI network.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] United Complex Centrality for Identification of Essential Proteins from PPI Networks
    Li, Min
    Lu, Yu
    Niu, Zhibei
    Wu, Fang-Xiang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (02) : 370 - 380
  • [22] NAIGO: An Improved Method to Align PPI Networks Based on Gene Ontology and Graphlets
    Zhu, Lijuan
    Zhang, Ju
    Zhang, Yi
    Lang, Jidong
    Xiang, Ju
    Bai, Xiaogang
    Yan, Na
    Tian, Geng
    Zhang, Huajun
    Yang, Jialiang
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [23] Protein functional module identification method combining topological features and gene expression data
    Zihao Zhao
    Wenjun Xu
    Aiwen Chen
    Yueyue Han
    Shengrong Xia
    ChuLei Xiang
    Chao Wang
    Jun Jiao
    Hui Wang
    Xiaohui Yuan
    Lichuan Gu
    [J]. BMC Genomics, 22
  • [24] Protein functional module identification method combining topological features and gene expression data
    Zhao, Zihao
    Xu, Wenjun
    Chen, Aiwen
    Han, Yueyue
    Xia, Shengrong
    Xiang, ChuLei
    Wang, Chao
    Jiao, Jun
    Wang, Hui
    Yuan, Xiaohui
    Gu, Lichuan
    [J]. BMC GENOMICS, 2021, 22 (01)
  • [25] Identification of modules of hepatic encephalopathy based on protein-protein network and gene expression data
    Wu, Hao
    Liu, Miao
    Zhuang, Jiajun
    [J]. EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2018, 15 (05) : 4344 - 4348
  • [26] A core-attachment based method to detect protein complexes in PPI networks
    Min Wu
    Xiaoli Li
    Chee-Keong Kwoh
    See-Kiong Ng
    [J]. BMC Bioinformatics, 10
  • [27] A core-attachment based method to detect protein complexes in PPI networks
    Wu, Min
    Li, Xiaoli
    Kwoh, Chee-Keong
    Ng, See-Kiong
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [28] Predicting essential proteins by integrating orthology, gene expressions, and PPI networks
    Zhang, Xue
    Xiao, Wangxin
    Hu, Xihao
    [J]. PLOS ONE, 2018, 13 (04):
  • [29] A Novel Essential Protein Identification Algorithm Based on the Integration of Local Network Topology and Gene Ontology
    Luo Jiawei
    Liu Shunmin
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (03) : 619 - 624
  • [30] A method for predicting essential proteins using gene expression data
    Patil, Soumya B.
    Sekhar, S. R. Mani
    Siddesh, G. M.
    Manvi, Sunilkumar S.
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 1278 - 1281