United Neighborhood Closeness Centrality and Orthology for Predicting Essential Proteins

被引:36
|
作者
Li, Gaoshi [1 ,2 ]
Li, Min [1 ]
Wang, Jianxin [1 ]
Li, Yaohang [3 ]
Pan, Yi [4 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Guangxi, Peoples R China
[3] Old Dominion Univ, Dept Comp Sci, Norfolk, VA USA
[4] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
基金
中国国家自然科学基金;
关键词
Neighborhood closeness centrality; essential proteins; PPI network; orthologous; EPOC; IDENTIFYING ESSENTIAL PROTEINS; INTERACTION NETWORK; ESSENTIAL GENES; DATABASE; IDENTIFICATION; DETERMINANTS; ANNOTATION; COMPLEXES; ALGORITHM;
D O I
10.1109/TCBB.2018.2889978
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying essential proteins plays an important role in disease study, drug design, and understanding the minimal requirement for cellular life. Computational methods for essential proteins discovery overcome the disadvantages of biological experimental methods that are often time-consuming, expensive, and inefficient. The topological features of protein-protein interaction (PPI) networks are often used to design computational prediction methods, such as Degree Centrality (DC), Betweenness Centrality (BC), Closeness Centrality (CC), Subgraph Centrality (SC), Eigenvector Centrality (EC), Information Centrality (IC), and Neighborhood Centrality (NC). However, the prediction accuracies of these individual methods still have space to be improved. Studies show that additional information, such as orthologous relations, helps discover essential proteins. Many researchers have proposed different methods by combining multiple information sources to gain improvement of prediction accuracy. In this study, we find that essential proteins appear in triangular structure in PPI network significantly more often than nonessential ones. Based on this phenomenon, we propose a novel pure centrality measure, so-called Neighborhood Closeness Centrality (NCC). Accordingly, we develop a new combination model, Extended Pareto Optimality Consensus model, named EPOC, to fuse NCC and Orthology information and a novel essential proteins identification method, NCCO, is fully proposed. Compared with seven existing classic centrality methods (DC, BC, IC, CC, SC, EC, and NC) and three consensus methods (PeC, ION, and CSC), our results on S.cerevisiae and E.coli datasets show that NCCO has clear advantages. As a consensus method, EPOC also yields better performance than the random walk model.
引用
收藏
页码:1451 / 1458
页数:8
相关论文
共 42 条
  • [1] Predicting essential proteins based on subcellular localization, orthology and PPI networks
    Gaoshi Li
    Min Li
    Jianxin Wang
    Jingli Wu
    Fang-Xiang Wu
    Yi Pan
    [J]. BMC Bioinformatics, 17
  • [2] Predicting essential proteins based on subcellular localization, orthology and PPI networks
    Li, Gaoshi
    Li, Min
    Wang, Jianxin
    Wu, Jingli
    Wu, Fang-Xiang
    Pan, Yi
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [3] Predicting Essential Proteins Based on Weighted Degree Centrality
    Tang, Xiwei
    Wang, Jianxin
    Zhong, Jiancheng
    Pan, Yi
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (02) : 407 - 418
  • [4] Predicting essential proteins by integrating orthology, gene expressions, and PPI networks
    Zhang, Xue
    Xiao, Wangxin
    Hu, Xihao
    [J]. PLOS ONE, 2018, 13 (04):
  • [5] Essential proteins discovery based on dominance relationship and neighborhood similarity centrality
    Li, Gaoshi
    Luo, Xinlong
    Hu, Zhipeng
    Wu, Jingli
    Peng, Wei
    Liu, Jiafei
    Zhu, Xiaoshu
    [J]. HEALTH INFORMATION SCIENCE AND SYSTEMS, 2023, 11 (01)
  • [6] Essential proteins discovery based on dominance relationship and neighborhood similarity centrality
    Gaoshi Li
    Xinlong Luo
    Zhipeng Hu
    Jingli Wu
    Wei Peng
    Jiafei Liu
    Xiaoshu Zhu
    [J]. Health Information Science and Systems, 11
  • [7] A Novel Method for Predicting Essential Proteins Based on Subcellular Localization, Orthology and PPI Networks
    Li, Gaoshi
    Li, Min
    Wang, Jianxin
    Pan, Yi
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2015), 2015, 9096 : 427 - 428
  • [8] United Complex Centrality for Identification of Essential Proteins from PPI Networks
    Li, Min
    Lu, Yu
    Niu, Zhibei
    Wu, Fang-Xiang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (02) : 370 - 380
  • [9] Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks
    Peng, Wei
    Wang, Jianxin
    Wang, Weiping
    Liu, Qing
    Wu, Fang-Xiang
    Pan, Yi
    [J]. BMC SYSTEMS BIOLOGY, 2012, 6
  • [10] Predicting Essential Proteins Based on Second-Order Neighborhood Information and Information Entropy
    Zhao, Jie
    Lei, Xiujuan
    [J]. IEEE ACCESS, 2019, 7 : 136012 - 136022