OH-PRED: prediction of protein hydroxylation sites by incorporating adapted normal distribution bi-profile Bayes feature extraction and physicochemical properties of amino acids

被引:17
|
作者
Jia, Cang-Zhi [1 ]
He, Wen-Ying [1 ]
Yao, Yu-Hua [2 ]
机构
[1] Dalian Maritime Univ, Dept Math, 1 Linghai Rd, Dalian 116026, Peoples R China
[2] Zhejiang Sci Tech Univ, Coll Life Sci, Xiasha Higher Educ Zone, 5 Second Ave, Hangzhou 310018, Zhejiang, Peoples R China
来源
关键词
protein hydroxylation; physicochemical properties; ANBPB; SVM; HYDROXYPROLINE; IDENTIFICATION; INFORMATION; AAINDEX; DOMAIN;
D O I
10.1080/07391102.2016.1163294
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Hydroxylation of proline or lysine residues in proteins is a common post-translational modification event, and such modifications are found in many physiological and pathological processes. Nonetheless, the exact molecular mechanism of hydroxylation remains under investigation. Because experimental identification of hydroxylation is time-consuming and expensive, bioinformatics tools with high accuracy represent desirable alternatives for large-scale rapid identification of protein hydroxylation sites. In view of this, we developed a supporter vector machine-based tool, OH-PRED, for the prediction of protein hydroxylation sites using the adapted normal distribution bi-profile Bayes feature extraction in combination with the physicochemical property indexes of the amino acids. In a jackknife cross validation, OH-PRED yields an accuracy of 91.88% and a Matthew's correlation coefficient (MCC) of 0.838 for the prediction of hydroxyproline sites, and yields an accuracy of 97.42% and a MCC of 0.949 for the prediction of hydroxylysine sites. These results demonstrate that OH-PRED increased significantly the prediction accuracy of hydroxyproline and hydroxylysine sites by 7.37 and 14.09%, respectively, when compared with the latest predictor PredHydroxy. In independent tests, OH-PRED also outperforms previously published methods.
引用
收藏
页码:829 / 835
页数:7
相关论文
共 3 条
  • [1] Prediction of Protein S-Nitrosylation Sites Based on Adapted Normal Distribution Bi-Profile Bayes and Chou's Pseudo Amino Acid Composition
    Jia, Cangzhi
    Lin, Xin
    Wang, Zhiping
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (06) : 10410 - 10423
  • [2] Computational Identification of Protein Methylation Sites through Bi-Profile Bayes Feature Extraction
    Shao, Jianlin
    Xu, Dong
    Tsai, Sau-Na
    Wang, Yifei
    Ngai, Sai-Ming
    [J]. PLOS ONE, 2009, 4 (03):
  • [3] Identification of Protein Pupylation Sites Using Bi-Profile Bayes Feature Extraction and Ensemble Learning
    Zhao, Xiaowei
    Zhang, Jian
    Ning, Qiao
    Sun, Pingping
    Ma, Zhiqiang
    Yin, Minghao
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013