An iteration method for identifying yeast essential proteins from heterogeneous network

被引:25
|
作者
Zhao, Bihai [1 ,3 ]
Zhao, Yulin [1 ]
Zhang, Xiaoxia [1 ]
Zhang, Zhihong [1 ]
Zhang, Fan [1 ]
Wang, Lei [1 ,2 ]
机构
[1] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410022, Hunan, Peoples R China
[2] Xiangtan Univ, Coll Informat Engn, Xiangtan 411105, Hunan, Peoples R China
[3] Changsha Univ, Dept Biol & Environm Engn, Hunan Prov Key Lab Nutr & Qual Control Aquat Anim, Changsha 410022, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Heterogeneous network; Protein-protein interaction; Essential proteins; CENTRALITY; INTEGRATION; IDENTIFICATION; PREDICTION; DATABASE;
D O I
10.1186/s12859-019-2930-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundEssential proteins are distinctly important for an organism's survival and development and crucial to disease analysis and drug design as well. Large-scale protein-protein interaction (PPI) data sets exist in Saccharomyces cerevisiae, which provides us with a valuable opportunity to predict identify essential proteins from PPI networks. Many network topology-based computational methods have been designed to detect essential proteins. However, these methods are limited by the completeness of available PPI data. To break out of these restraints, some computational methods have been proposed by integrating PPI networks and multi-source biological data. Despite the progress in the research of multiple data fusion, it is still challenging to improve the prediction accuracy of the computational methods.ResultsIn this paper, we design a novel iterative model for essential proteins prediction, named Randomly Walking in the Heterogeneous Network (RWHN). In RWHN, a weighted protein-protein interaction network and a domain-domain association network are constructed according to the original PPI network and the known protein-domain association network, firstly. And then, we establish a new heterogeneous matrix by combining the two constructed networks with the protein-domain association network. Based on the heterogeneous matrix, a transition probability matrix is established by normalized operation. Finally, an improved PageRank algorithm is adopted on the heterogeneous network for essential proteins prediction. In order to eliminate the influence of the false negative, information on orthologous proteins and the subcellular localization information of proteins are integrated to initialize the score vector of proteins. In RWHN, the topology, conservative and functional features of essential proteins are all taken into account in the prediction process. The experimental results show that RWHN obviously exceeds in predicting essential proteins ten other competing methods.ConclusionsWe demonstrated that integrating multi-source data into a heterogeneous network can preserve the complex relationship among multiple biological data and improve the prediction accuracy of essential proteins. RWHN, our proposed method, is effective for the prediction of essential proteins.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An iteration method for identifying yeast essential proteins from heterogeneous network
    Bihai Zhao
    Yulin Zhao
    Xiaoxia Zhang
    Zhihong Zhang
    Fan Zhang
    Lei Wang
    [J]. BMC Bioinformatics, 20
  • [2] An Iteration Method for Identifying Yeast Essential Proteins From Weighted PPI Network Based on Topological and Functional Features of Proteins
    Li, Shiyuan
    Chen, Zhiping
    He, Xin
    Zhang, Zhen
    Pei, Tingrui
    Tan, Yihong
    Wang, Lei
    [J]. IEEE ACCESS, 2020, 8 : 90792 - 90804
  • [3] An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information
    Shiyuan Li
    Zhen Zhang
    Xueyong Li
    Yihong Tan
    Lei Wang
    Zhiping Chen
    [J]. BMC Bioinformatics, 22
  • [4] An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information
    Li, Shiyuan
    Zhang, Zhen
    Li, Xueyong
    Tan, Yihong
    Wang, Lei
    Chen, Zhiping
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [5] A local average connectivity-based method for identifying essential proteins from the network level
    Li, Min
    Wang, Jianxin
    Chen, Xiang
    Wang, Huan
    Pan, Yi
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (03) : 143 - 150
  • [6] Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
    He, Xin
    Kuang, Linai
    Chen, Zhiping
    Tan, Yihong
    Wang, Lei
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [8] A New Method for Identifying Essential Proteins Based on Network Topology Properties and Protein Complexes
    Qin, Chao
    Sun, Yongqi
    Dong, Yadong
    [J]. PLOS ONE, 2016, 11 (08):
  • [9] Refining Protein Interaction Network for Identifying Essential Proteins
    Zhang, Houwang
    Feng, Zhenan
    Wu, Chong
    [J]. CURRENT BIOINFORMATICS, 2023, 18 (03) : 255 - 265
  • [10] Identifying Essential Proteins in Dynamic PPI Network with Improved FOA
    Lei, X.
    Wang, S.
    Pan, L.
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2018, 13 (03) : 365 - 382