An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information

被引:5
|
作者
Li, Shiyuan [1 ,2 ]
Zhang, Zhen [3 ]
Li, Xueyong [1 ,2 ]
Tan, Yihong [1 ,2 ]
Wang, Lei [1 ,2 ]
Chen, Zhiping [1 ,2 ]
机构
[1] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410022, Peoples R China
[2] Changsha Univ, Hunan Prov Key Lab Ind Internet Technol & Secur, Changsha 410022, Peoples R China
[3] Changsha Univ, Coll Elect Informat & Elect Engn, Changsha 410022, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential proteins; Orthologous proteins; Multiplex biological networks; Subcellular localization information; SUBCELLULAR-LOCALIZATION; GENE ONTOLOGY; CENTRALITY; IDENTIFICATION; ANNOTATION; COMPLEXES; DATABASE; TOOL;
D O I
10.1186/s12859-021-04300-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Essential proteins have great impacts on cell survival and development, and played important roles in disease analysis and new drug design. However, since it is inefficient and costly to identify essential proteins by using biological experiments, then there is an urgent need for automated and accurate detection methods. In recent years, the recognition of essential proteins in protein interaction networks (PPI) has become a research hotspot, and many computational models for predicting essential proteins have been proposed successively. Results In order to achieve higher prediction performance, in this paper, a new prediction model called TGSO is proposed. In TGSO, a protein aggregation degree network is constructed first by adopting the node density measurement method for complex networks. And simultaneously, a protein co-expression interactive network is constructed by combining the gene expression information with the network connectivity, and a protein co-localization interaction network is constructed based on the subcellular localization data. And then, through integrating these three kinds of newly constructed networks, a comprehensive protein-protein interaction network will be obtained. Finally, based on the homology information, scores can be calculated out iteratively for different proteins, which can be utilized to estimate the importance of proteins effectively. Moreover, in order to evaluate the identification performance of TGSO, we have compared TGSO with 13 different latest competitive methods based on three kinds of yeast databases. And experimental results show that TGSO can achieve identification accuracies of 94%, 82% and 72% out of the top 1%, 5% and 10% candidate proteins respectively, which are to some degree superior to these state-of-the-art competitive models. Conclusions We constructed a comprehensive interactive network based on multi-source data to reduce the noise and errors in the initial PPI, and combined with iterative methods to improve the accuracy of necessary protein prediction, and means that TGSO may be conducive to the future development of essential protein recognition as well.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information
    Shiyuan Li
    Zhen Zhang
    Xueyong Li
    Yihong Tan
    Lei Wang
    Zhiping Chen
    [J]. BMC Bioinformatics, 22
  • [2] Identification of Essential Proteins by Using Complexes and Biological Information on Dynamic PPI Network
    Liu, Wei
    Ma, Liangyu
    Chen, Ling
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 127 - 132
  • [3] An Iteration Method for Identifying Yeast Essential Proteins From Weighted PPI Network Based on Topological and Functional Features of Proteins
    Li, Shiyuan
    Chen, Zhiping
    He, Xin
    Zhang, Zhen
    Pei, Tingrui
    Tan, Yihong
    Wang, Lei
    [J]. IEEE ACCESS, 2020, 8 : 90792 - 90804
  • [4] Identifying Essential Proteins in Dynamic PPI Network with Improved FOA
    Lei, X.
    Wang, S.
    Pan, L.
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2018, 13 (03) : 365 - 382
  • [5] An iteration method for identifying yeast essential proteins from heterogeneous network
    Bihai Zhao
    Yulin Zhao
    Xiaoxia Zhang
    Zhihong Zhang
    Fan Zhang
    Lei Wang
    [J]. BMC Bioinformatics, 20
  • [6] An iteration method for identifying yeast essential proteins from heterogeneous network
    Zhao, Bihai
    Zhao, Yulin
    Zhang, Xiaoxia
    Zhang, Zhihong
    Zhang, Fan
    Wang, Lei
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [7] A new computational strategy for identifying essential proteins based on network topological properties and biological information
    Qin, Chao
    Sun, Yongqi
    Dong, Yadong
    [J]. PLOS ONE, 2017, 12 (07):
  • [8] Prediction of Essential Proteins by Integration of PPI Network Topology and Protein Complexes Information
    Ren, Jun
    Wang, Jianxin
    Li, Min
    Wang, Huan
    Liu, Binbin
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 12 - 24
  • [9] A deep learning framework for identifying essential proteins based on multiple biological information
    Yi Yue
    Chen Ye
    Pei-Yun Peng
    Hui-Xin Zhai
    Iftikhar Ahmad
    Chuan Xia
    Yun-Zhi Wu
    You-Hua Zhang
    [J]. BMC Bioinformatics, 23
  • [10] A deep learning framework for identifying essential proteins based on multiple biological information
    Yue, Yi
    Ye, Chen
    Peng, Pei-Yun
    Zhai, Hui-Xin
    Ahmad, Iftikhar
    Xia, Chuan
    Wu, Yun-Zhi
    Zhang, You-Hua
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)