Improving enzyme regulatory protein classification by means of SVM-RFE feature selection

被引:17
|
作者
Fernandez-Lozano, Carlos [1 ]
Fernandez-Blanco, Enrique [1 ]
Dave, Kirtan [2 ]
Pedreira, Nieves [1 ]
Gestal, Marcos [1 ]
Dorado, Julian [1 ]
Munteanu, Cristian R. [1 ]
机构
[1] Univ A Coruna, Dept Informat & Commun Technol, Fac Comp Sci, La Coruna 15071, Spain
[2] Sardar Patel Univ, GH Patel PG Dept Comp Sci & Technol, Vallabh Vidyanagar 388120, Gujarat, India
关键词
SUPPORT VECTOR MACHINES; COMPUTATIONAL CHEMISTRY; WEB SERVER; B INHIBITORS; MARCH-INSIDE; QSAR MODELS; 3D; RECOGNITION; DISCOVERY; DRUGS;
D O I
10.1039/c3mb70489k
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Enzyme regulation proteins are very important due to their involvement in many biological processes that sustain life. The complexity of these proteins, the impossibility of identifying direct quantification molecular properties associated with the regulation of enzymatic activities, and their structural diversity creates the necessity for new theoretical methods that can predict the enzyme regulatory function of new proteins. The current work presents the first classification model that predicts protein enzyme regulators using the Markov mean properties. These protein descriptors encode the topological information of the amino acid into contact networks based on amino acid distances and physicochemical properties. MInD-Prot software calculated these molecular descriptors for 2415 protein chains (350 enzyme regulators) using five atom physicochemical properties (Mulliken electronegativity, Kang-Jhon polarizability, vdW area, atom contribution to P) and the protein 3D regions. The best classification models to predict enzyme regulators have been obtained with machine learning algorithms from Weka using 18 features. K-star has been demonstrated to be the most accurate algorithm for this protein function classification. Wrapper Subset Evaluator and SVM-RFE approaches were used to perform a feature subset selection with the best results obtained from SVM-RFE. Classification performance employing all the available features can be reached using only the 8 most relevant features selected by SVM-RFE. Thus, the current work has demonstrated the possibility of predicting new molecular targets involved in enzyme regulation using fast theoretical algorithms.
引用
收藏
页码:1063 / 1071
页数:9
相关论文
共 50 条
  • [41] MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
    Zhou, Xin
    Tuck, David P.
    BIOINFORMATICS, 2007, 23 (09) : 1106 - 1114
  • [42] Granular SVM-RFE gene selection algorithm for reliable prostate cancer classification on microarray expression data
    Tang, YC
    Zhang, YQ
    Huang, Z
    Hu, XH
    BIBE 2005: 5TH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, 2005, : 290 - 293
  • [43] AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
    Sejong Yoon
    Saejoon Kim
    BMC Medical Informatics and Decision Making, 9
  • [44] AdaBoost-Based Multiple SVM-RFE for Classification of Mammograms in DDSM
    Yoon, Sejong
    Kim, Saejoon
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 75 - 82
  • [45] AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
    Yoon, Sejong
    Kim, Saejoon
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2009, 9
  • [46] Improved Automatic Filtering Algorithm for Imbalanced Classification based on SVM-RFE
    Li, Xiaoqiang
    Shao, Qing
    Wang, Jingjing
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [47] Support Vector Machine - Recursive Feature Elimination (SVM-RFE) for Selection of MicroRNA Expression Features of Breast Cancer
    Adorada, Amazona
    Permatasari, Ratih
    Wirawan, Panji Wisnu
    Wibowo, Adi
    Sujiwo, Adi
    2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 165 - 168
  • [48] Feature reduction using SVM-RFE technique to detect autism spectrum disorder
    Mohan, Priya
    Paramasivam, Ilango
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 989 - 997
  • [49] Selecting Feature Subsets Based on SVM-RFE and the Overlapping Ratio with Applications in Bioinformatics
    Lin, Xiaohui
    Li, Chao
    Zhang, Yanhui
    Su, Benzhe
    Fan, Meng
    Wei, Hai
    MOLECULES, 2018, 23 (01):
  • [50] Feature reduction using SVM-RFE technique to detect autism spectrum disorder
    Priya Mohan
    Ilango Paramasivam
    Evolutionary Intelligence, 2021, 14 : 989 - 997