Identification of Protein Pupylation Sites Using Bi-Profile Bayes Feature Extraction and Ensemble Learning

被引:10
|
作者
Zhao, Xiaowei [1 ,2 ]
Zhang, Jian [1 ]
Ning, Qiao [1 ]
Sun, Pingping [1 ]
Ma, Zhiqiang [1 ]
Yin, Minghao [2 ]
机构
[1] Northeast Normal Univ, Coll Comp Sci & Informat Technol, Changchun 130117, Peoples R China
[2] Northeast Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Peoples R China
关键词
COMPUTATIONAL IDENTIFICATION; CD-HIT; PREDICTION; UBIQUITIN; PUP; UBIQUITYLATION; CLASSIFIER; ALGORITHM; PLOC;
D O I
10.1155/2013/283129
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Pupylation, one of the most important posttranslational modifications of proteins, typically takes place when prokaryotic ubiquitin-like protein (Pup) is attached to specific lysine residues on a target protein. Identification of pupylation substrates and their corresponding sites will facilitate the understanding of the molecular mechanism of pupylation. Comparing with the labor-intensive and time-consuming experiment approaches, computational prediction of pupylation sites is much desirable for their convenience and fast speed. In this study, a new bioinformatics tool named EnsemblePup was developed that used an ensemble of support vector machine classifiers to predict pupylation sites. The highlight of EnsemblePup was to utilize the Bi-profile Bayes feature extraction as the encoding scheme. The performance of EnsemblePup was measured with a sensitivity of 79.49%, a specificity of 82.35%, an accuracy of 85.43%, and a Matthews correlation coefficient of 0.617 using the 5-fold cross validation on the training dataset. When compared with other existing methods on a benchmark dataset, the EnsemblePup provided better predictive performance, with a sensitivity of 80.00%, a specificity of 83.33%, an accuracy of 82.00%, and a Matthews correlation coefficient of 0.629. The experimental results suggested that EnsemblePup presented here might be useful to identify and annotate potential pupylation sites in proteins of interest. A web server for predicting pupylation sites was developed.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Computational Identification of Protein Methylation Sites through Bi-Profile Bayes Feature Extraction
    Shao, Jianlin
    Xu, Dong
    Tsai, Sau-Na
    Wang, Yifei
    Ngai, Sai-Ming
    [J]. PLOS ONE, 2009, 4 (03):
  • [2] Identification of Cancerlectins by Split Bi-profile Bayes Feature Extraction
    Zuo, Yun
    Jia, Cangzhi
    Li, Taoying
    Chen, Yan
    [J]. CURRENT PROTEOMICS, 2018, 15 (03) : 196 - 200
  • [3] Predicting lysine glycation sites using bi-profile bayes feature extraction
    Ju, Zhe
    Sun, Juhe
    Li, Yanjie
    Wang, Li
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2017, 71 : 98 - 103
  • [4] Prediction of mitochondrial proteins of malaria parasite using bi-profile Bayes feature extraction
    Jia, Cangzhi
    Liu, Tian
    Chang, Alan K.
    Zhai, Yingying
    [J]. BIOCHIMIE, 2011, 93 (04) : 778 - 782
  • [5] Predicting lysine lipoylation sites using bi-profile bayes feature extraction and fuzzy support vector machine algorithm
    Ju, Zhe
    Wang, Shi-Yun
    [J]. ANALYTICAL BIOCHEMISTRY, 2018, 561 : 11 - 17
  • [6] OH-PRED: prediction of protein hydroxylation sites by incorporating adapted normal distribution bi-profile Bayes feature extraction and physicochemical properties of amino acids
    Jia, Cang-Zhi
    He, Wen-Ying
    Yao, Yu-Hua
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (04): : 829 - 835
  • [7] Predicting lysine lipoylation sites using bi-profile bayes feature extraction and fuzzy support vector machine algorithm ( vol 561, pg 11, 2024)
    Ju, Zhe
    Wang, Shi-Yun
    [J]. ANALYTICAL BIOCHEMISTRY, 2024, 695
  • [8] Identify Lysine Neddylation Sites Using Bi-profile Bayes Feature Extraction via the Chou's 5-steps Rule and General Pseudo Components
    Ju, Zhe
    Wang, Shi-Yun
    [J]. CURRENT GENOMICS, 2019, 20 (08) : 592 - 601
  • [9] CIPPN: computational identification of protein pupylation sites by using neural network
    Bao, Wenzheng
    You, Zhu-Hong
    Huang, De-Shuang
    [J]. ONCOTARGET, 2017, 8 (65) : 108867 - 108879
  • [10] Prediction of Protein S-Nitrosylation Sites Based on Adapted Normal Distribution Bi-Profile Bayes and Chou's Pseudo Amino Acid Composition
    Jia, Cangzhi
    Lin, Xin
    Wang, Zhiping
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (06) : 10410 - 10423