Accurate prediction of protein structural classes by incorporating PSSS and PSSM into Chou's general PseAAC

被引:32
|
作者
Zhang, Shengli [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein structural class prediction; Secondary structure sequences; Support vector machine; Transition probability matrix; Pseudo-position specific scoring matrix; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; SECONDARY STRUCTURE PREDICTION; LOW-SIMILARITY SEQUENCES; PSI-BLAST; INFORMATION; FORM; CLASSIFIER; HOMOLOGY; ATTRIBUTES;
D O I
10.1016/j.chemolab.2015.01.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determination of protein structural class using a fast and suitable computational method has become a hot issue in protein science. Prediction of protein structural class for low-similarity sequences remains a challenge problem. In this study, a 111-dimensional feature vector is constructed to predict protein structural classes. Among the 111 features, 100 features based on pseudo-position specific scoring matrix (PsePSSM) are selected to reflect the evolutionary information and the sequence-order information, and the other 11 rational features based on predicted protein secondary structure sequences (PSSS) are designed in the previous works. To evaluate the performance of the proposed method (named by PSSS-PsePSSM), jackknife cross-validation tests are performed on three widely used benchmark datasets: 1189, 25PDB and 640. Our method achieves competitive performance on prediction accuracies, especially for the overall prediction accuracies for datasets 1189, 25PDB and 640, which reach 86.6%, 89.5% and 81.0%, respectively. The PSSS-PsePSSM algorithm also outperforms other existing methods, indicating that our proposed method is a cost-effective computational tool for protein structural class prediction. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:28 / 35
页数:8
相关论文
共 50 条