共 4 条
Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence
被引:23
|作者:
Liang, Yunyun
[1
]
Zhang, Shengli
[2
]
机构:
[1] Xian Polytech Univ, Sch Sci, Xian 710048, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Math & Stat, Xian 710071, Shaanxi, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Secreted proteins;
Position-specific scoring matrix;
Correlation analysis;
Nonnegative matrix factorization;
Support vector machine;
PREDICT SUBCELLULAR-LOCALIZATION;
PSEUDO NUCLEOTIDE COMPOSITION;
AMINO-ACID-COMPOSITION;
ENSEMBLE CLASSIFIER;
STRUCTURAL CLASSES;
WEB SERVER;
SITES;
RNA;
IDENTIFICATION;
INFORMATION;
D O I:
10.1016/j.jtbi.2018.05.035
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Gram-negative bacterial secreted proteins are crucial for bacterial pathogenesis by making bacteria interact with their environments. Therefore, identification of bacterial secreted proteins becomes a significant process for the research of various diseases and the corresponding drugs. In this paper, we develop a feature design model named ACCP-KL-NMF by fusing PSSM-based auto-cross correlation analysis for features extraction and nonnegative matrix factorization algorithm based on Kullback-Leibler divergence for dimensionality reduction. Hence, a 150-dimensional feature vector is constructed on the training set. Then support vector machine is adopted as the classifier, and the most objective jackknife test is chosen for evaluating the accuracy. The ACCP-KL-NMF model yields the approving performance of the overall accuracy on the test set, and also outperforms the other three existing models. The numerical experimental results show that our model is effective and reliable for identification of Gram-negative bacterial secreted protein types. Moreover, it is anticipated that the proposed model could be beneficial for other biology sequence in future research. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:22 / 29
页数:8
相关论文