Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence

被引:23
|
作者
Liang, Yunyun [1 ]
Zhang, Shengli [2 ]
机构
[1] Xian Polytech Univ, Sch Sci, Xian 710048, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Math & Stat, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Secreted proteins; Position-specific scoring matrix; Correlation analysis; Nonnegative matrix factorization; Support vector machine; PREDICT SUBCELLULAR-LOCALIZATION; PSEUDO NUCLEOTIDE COMPOSITION; AMINO-ACID-COMPOSITION; ENSEMBLE CLASSIFIER; STRUCTURAL CLASSES; WEB SERVER; SITES; RNA; IDENTIFICATION; INFORMATION;
D O I
10.1016/j.jtbi.2018.05.035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gram-negative bacterial secreted proteins are crucial for bacterial pathogenesis by making bacteria interact with their environments. Therefore, identification of bacterial secreted proteins becomes a significant process for the research of various diseases and the corresponding drugs. In this paper, we develop a feature design model named ACCP-KL-NMF by fusing PSSM-based auto-cross correlation analysis for features extraction and nonnegative matrix factorization algorithm based on Kullback-Leibler divergence for dimensionality reduction. Hence, a 150-dimensional feature vector is constructed on the training set. Then support vector machine is adopted as the classifier, and the most objective jackknife test is chosen for evaluating the accuracy. The ACCP-KL-NMF model yields the approving performance of the overall accuracy on the test set, and also outperforms the other three existing models. The numerical experimental results show that our model is effective and reliable for identification of Gram-negative bacterial secreted protein types. Moreover, it is anticipated that the proposed model could be beneficial for other biology sequence in future research. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:22 / 29
页数:8
相关论文
共 4 条
  • [1] Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou's general PseAAC
    Dehzangi, Abdollah
    Heffernan, Rhys
    Sharma, Alok
    Lyons, James
    Paliwal, Kuldip
    Sattar, Abdul
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 364 : 284 - 294
  • [2] Predict Gram-Positive and Gram-Negative Subcellular Localization via Incorporating Evolutionary Information and Physicochemical Features Into Chou's General PseAAC
    Sharma, Ronesh
    Dehzangi, Abdollah
    Lyons, James
    Paliwal, Kuldip
    Tsunoda, Tatsuhiko
    Sharma, Alok
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (08) : 915 - 926
  • [3] An ensemble method for multi-type Gram-negative bacterial secreted protein prediction by integrating different PSSM-based features
    Kong, L.
    Zhang, L.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2019, 30 (03) : 181 - 194
  • [4] Unb-DPC: Identify mycobacterial membrane protein types by incorporating un-biased dipeptide composition into Chou's general PseAAC
    Khan, Muslim
    Hayat, Maqsood
    Khan, Sher Afzal
    Iqbal, Nadeem
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 415 : 13 - 19