Predict Gram-Positive and Gram-Negative Subcellular Localization via Incorporating Evolutionary Information and Physicochemical Features Into Chou's General PseAAC

被引:58
|
作者
Sharma, Ronesh [1 ,2 ]
Dehzangi, Abdollah [3 ]
Lyons, James [4 ]
Paliwal, Kuldip [4 ]
Tsunoda, Tatsuhiko [5 ,6 ,7 ]
Sharma, Alok [8 ,9 ]
机构
[1] Fiji Natl Univ, Sch Elect & Elect Engn, Suva, Fiji
[2] Univ S Pacific, Sch Phys & Engn, Suva, Fiji
[3] Univ Iowa, Med Res Ctr, Dept Psychiat, Iowa City, IA 52242 USA
[4] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia
[5] RIKEN, Ctr Integrat Med Sci, Yokohama, Kanagawa 2300045, Japan
[6] JST, CREST, Yokohama, Kanagawa 2300045, Japan
[7] Tokyo Med & Dent Univ, Med Res Inst, Tokyo 1138510, Japan
[8] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4111, Australia
[9] RIKEN, Yokohama, Kanagawa 2300045, Japan
关键词
Evolutionary-based features; normalized PSSM; AMINO-ACID-COMPOSITION; LINEAR DISCRIMINANT-ANALYSIS; SEQUENCE-BASED PREDICTOR; MULTI-LABEL CLASSIFIER; PROTEIN STRUCTURAL CLASS; SUPPORT VECTOR MACHINES; TOP-DOWN APPROACH; ENSEMBLE CLASSIFIER; LEARNING CLASSIFIER; SINGLE;
D O I
10.1109/TNB.2015.2500186
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this study, we used structural and evolutionary based features to represent the sequences of gram-positive and gram-negative subcellular localizations. To do this, we proposed a normalization method to construct a normalize Position Specific Scoring Matrix (PSSM) using the information from original PSSM. To investigate the effectiveness of the proposed method we compute feature vectors from normalize PSSM and by applying support vector machine (SVM) and naive Bayes classifier, respectively, we compared achieved results with the previously reported results. We also computed features from original PSSM and normalized PSSM and compared their results. The archived results show enhancement in gram-positive and gram-negative subcellular localizations. Evaluating localization for each feature, our results indicate that employing SVM and concatenating features (amino acid composition feature, Dubchak feature (physicochemical-based features), normalized PSSM based auto-covariance feature and normalized PSSM based bigram feature) have higher accuracy while employing naive Bayes classifier with normalized PSSM based auto-covariance feature proves to have high sensitivity for both benchmarks. Our reported results in terms of overall locative accuracy is 84.8% and overall absolute accuracy is 85.16% for gram-positive dataset; and, for gram-negative dataset, overall locative accuracy is 85.4% and overall absolute accuracy is 86.3%.
引用
收藏
页码:915 / 926
页数:12
相关论文
共 50 条
  • [1] Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou's general PseAAC
    Dehzangi, Abdollah
    Heffernan, Rhys
    Sharma, Alok
    Lyons, James
    Paliwal, Kuldip
    Sattar, Abdul
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 364 : 284 - 294
  • [2] Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
    Dehzangi, Abdollah
    Sohrabi, Sohrab
    Heffernan, Rhys
    Sharma, Alok
    Lyons, James
    Paliwal, Kuldip
    Sattar, Abdul
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [3] Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
    Abdollah Dehzangi
    Sohrab Sohrabi
    Rhys Heffernan
    Alok Sharma
    James Lyons
    Kuldip Paliwal
    Abdul Sattar
    [J]. BMC Bioinformatics, 16
  • [4] Gram-Positive and Gram-Negative Subcellular Localization Using Rotation Forest and Physicochemical-Based Features
    Dehzangi, Abdollah
    Heffernan, Rhys
    Lyons, James
    Sharma, Alok
    Paliwal, Kuldip
    Sattar, Abdul
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 112 - 113
  • [5] Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC
    Shen, Yinan
    Tang, Jijun
    Guo, Fei
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2019, 462 : 230 - 239
  • [6] pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENOMICS, 2018, 110 (04) : 231 - 239
  • [7] Gram-LocEN: Interpretable prediction of subcellular multi-localization of Gram-positive and Gram-negative bacterial proteins
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 162 : 1 - 9
  • [8] Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multilabel learning based on gene ontology annotation and profile alignment
    Bouziane, Hafida
    Chouarfia, Abdallah
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2021, 18 (01) : 51 - 79
  • [9] Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence
    Liang, Yunyun
    Zhang, Shengli
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2018, 454 : 22 - 29
  • [10] ELECTRON MICROSCOPIC FEATURES OF GRAM-NEGATIVE + GRAM-POSITIVE BACTERIA EMBEDDED IN PHOSPHOTUNGSTATE
    ZWILLENBERG, LO
    [J]. ANTONIE VAN LEEUWENHOEK JOURNAL OF MICROBIOLOGY AND SEROLOGY, 1964, 30 (02): : 154 - &