Application of Intelligent Techniques for Classification of Bacteria Using Protein Sequence-Derived Features

被引:5
|
作者
Banerjee, Amit Kumar [1 ]
Ravi, Vadlamani [2 ]
Murty, U. S. N. [1 ]
Sengupta, Neelava [1 ]
Karuna, Batepatti [1 ]
机构
[1] Indian Inst Chem Technol CSIR, Div Biol, Bioinformat Grp, Hyderabad, Andhra Pradesh, India
[2] Inst Dev & Res Banking Technol IDBRT, Hyderabad, Andhra Pradesh, India
关键词
Histidine kinase; Classification; Datamining; Physicochemical property; Support vector machine; Radial basis function; MACHINE-LEARNING APPROACH; HISTIDINE KINASE; PHYSICOCHEMICAL PROPERTIES;
D O I
10.1007/s12010-013-0268-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Standard molecular experimental methodologies and mathematical procedures often fail to answer many phylogeny and classification related issues. Modern artificial intelligent-based techniques, such as radial basis function, genetic algorithm, artificial neural network, and support vector machines are of ample potential in this regard. Reliance on a large number of essential parameters will aid in enhanced robustness, reliability, and better accuracy as opposed to single molecular parameter. This study was conducted with dataset of computed protein physicochemical properties belonging to 20 different bacterial genera. A total of 57 sequential and structural parameters derived from protein sequences were considered for the initial classification. Feature selection based techniques were employed to find out the most important features influencing the dataset. Various amino acids, hydrophobicity, relative sulfur percentage, and codon number were selected as important parameters during the study. Comparative analyses were performed applying RapidMiner data mining platform. Support vector machine proved to be the best method with maximum accuracy of more than 91 %.
引用
收藏
页码:1263 / 1281
页数:19
相关论文
共 50 条
  • [1] Application of Intelligent Techniques for Classification of Bacteria Using Protein Sequence-Derived Features
    Amit Kumar Banerjee
    Vadlamani Ravi
    U. S. N. Murty
    Neelava Sengupta
    Batepatti Karuna
    [J]. Applied Biochemistry and Biotechnology, 2013, 170 : 1263 - 1281
  • [2] An improved classification of G-protein-coupled receptors using sequence-derived features
    Peng, Zhen-Ling
    Yang, Jian-Yi
    Chen, Xin
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [3] An improved classification of G-protein-coupled receptors using sequence-derived features
    Zhen-Ling Peng
    Jian-Yi Yang
    Xin Chen
    [J]. BMC Bioinformatics, 11
  • [4] A statistical model for improved membrane protein expression using sequence-derived features
    Saladi, Shyam M.
    Javed, Nauman
    Muller, Axel
    Clemons, William M., Jr.
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2018, 293 (13) : 4913 - 4927
  • [5] Protein fold recognition using sequence-derived predictions
    Fischer, D
    Eisenberg, D
    [J]. PROTEIN SCIENCE, 1996, 5 (05) : 947 - 955
  • [6] Predicting Membrane Protein Expression in Yeast from Sequence-Derived Features
    Schulte, Samuel J.
    Saladi, Shyam
    Clemons, William M.
    [J]. BIOPHYSICAL JOURNAL, 2017, 112 (03) : 355A - 356A
  • [7] Prediction of Bacterial sRNAs Using Sequence-Derived Features and Machine Learning
    Jha, Tony
    Mendel, Jovinna
    Cho, Hyuk
    Choudhary, Madhusudan
    [J]. BIOINFORMATICS AND BIOLOGY INSIGHTS, 2022, 16
  • [8] Prediction of Bacterial sRNAs Using Sequence-Derived Features and Machine Learning
    Jha, Tony
    Mendel, Jovinna
    Cho, Hyuk
    Choudhary, Madhusudan
    [J]. BIOINFORMATICS AND BIOLOGY INSIGHTS, 2022, 16
  • [9] An Evaluation of Machine Learning Approaches for the Prediction of Essential Genes in Eukaryotes Using Protein Sequence-Derived Features
    Campos, Tulio L.
    Korhonen, Pasi K.
    Gasser, Robin B.
    Young, Neil D.
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2019, 17 : 785 - 796
  • [10] Transmembrane region prediction by using sequence-derived features and machine learning methods
    Yan, Renxiang
    Wang, Xiaofeng
    Huang, Lanqing
    Tian, Yarong
    Cai, Weiwen
    [J]. RSC ADVANCES, 2017, 7 (46) : 29200 - 29211