Improving enzyme regulatory protein classification by means of SVM-RFE feature selection

被引:17
|
作者
Fernandez-Lozano, Carlos [1 ]
Fernandez-Blanco, Enrique [1 ]
Dave, Kirtan [2 ]
Pedreira, Nieves [1 ]
Gestal, Marcos [1 ]
Dorado, Julian [1 ]
Munteanu, Cristian R. [1 ]
机构
[1] Univ A Coruna, Dept Informat & Commun Technol, Fac Comp Sci, La Coruna 15071, Spain
[2] Sardar Patel Univ, GH Patel PG Dept Comp Sci & Technol, Vallabh Vidyanagar 388120, Gujarat, India
关键词
SUPPORT VECTOR MACHINES; COMPUTATIONAL CHEMISTRY; WEB SERVER; B INHIBITORS; MARCH-INSIDE; QSAR MODELS; 3D; RECOGNITION; DISCOVERY; DRUGS;
D O I
10.1039/c3mb70489k
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Enzyme regulation proteins are very important due to their involvement in many biological processes that sustain life. The complexity of these proteins, the impossibility of identifying direct quantification molecular properties associated with the regulation of enzymatic activities, and their structural diversity creates the necessity for new theoretical methods that can predict the enzyme regulatory function of new proteins. The current work presents the first classification model that predicts protein enzyme regulators using the Markov mean properties. These protein descriptors encode the topological information of the amino acid into contact networks based on amino acid distances and physicochemical properties. MInD-Prot software calculated these molecular descriptors for 2415 protein chains (350 enzyme regulators) using five atom physicochemical properties (Mulliken electronegativity, Kang-Jhon polarizability, vdW area, atom contribution to P) and the protein 3D regions. The best classification models to predict enzyme regulators have been obtained with machine learning algorithms from Weka using 18 features. K-star has been demonstrated to be the most accurate algorithm for this protein function classification. Wrapper Subset Evaluator and SVM-RFE approaches were used to perform a feature subset selection with the best results obtained from SVM-RFE. Classification performance employing all the available features can be reached using only the 8 most relevant features selected by SVM-RFE. Thus, the current work has demonstrated the possibility of predicting new molecular targets involved in enzyme regulation using fast theoretical algorithms.
引用
收藏
页码:1063 / 1071
页数:9
相关论文
共 50 条
  • [31] Absolute cosine-based SVM-RFE feature selection method for prostate histopathological grading
    Sahran, Shahnorbanun
    Albashish, Dheeb
    Abdullah, Azizi
    Abd Shukor, Nordashima
    Pauzi, Suria Hayati Md
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 87 : 78 - 90
  • [32] On feature selection and blast furnace temperature tendency prediction in hot metal based on SVM-RFE
    Wang, Yi-Kang
    Liu, Xue-Yi
    Zhang, Bao-Lin
    2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 371 - 376
  • [33] Improving the Performance of SVM-RFE to Select Genes in Microarray Data
    Yuanyuan Ding
    Dawn Wilkins
    BMC Bioinformatics, 7
  • [34] Hepatitis Detection using Random Forest based on SVM-RFE (Recursive Feature Elimination) Feature Selection and SMOTE
    Krisnabayu, Rifky Yunus
    Ridok, Achmad
    Budi, Agung Setia
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 151 - 156
  • [35] Improving the performance of SVM-RFE to select genes in microarray data
    Ding, Yuanyuan
    Wilkins, Dawn
    BMC BIOINFORMATICS, 2006, 7 (Suppl 2)
  • [36] ECoG classification based on band power normalization and SVM-RFE
    Liu, Chong
    Zhao, Haibin
    Li, Chunsheng
    Wang, Hong
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2011, 32 (03): : 534 - 539
  • [37] An Improved SVM-RFE Based on F-Statistic and mPDC for Gene Selection in Cancer Classification
    Luo, Kangyang
    Wang, Guoqiang
    Li, Qian
    Tao, Jiyuan
    IEEE ACCESS, 2019, 7 : 147617 - 147628
  • [38] SVM-RFE-ED: A Novel SVM-RFE based on Energy Distance for Gene Selection and Cancer Diagnosis
    Medjahed, Seyyid Ahmed
    Ouali, Mohammed
    COMPUTACION Y SISTEMAS, 2018, 22 (02): : 675 - 683
  • [39] Arabic Named Entity Recognition on Social Media based on feature selection techniques using SVM-RFE
    Ali, Brahim Ait Ben
    Mihi, Soukaina
    Bazi, Ismail El
    Laachfoubi, Nahil
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [40] Feature selection and analysis of single lateral damper fault based on SVM-RFE with correlation bias reduction
    Tang Daochao
    Jin Weidong
    Qin Na
    Li Hui
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3830 - 3835