Selecting Feature Subsets Based on SVM-RFE and the Overlapping Ratio with Applications in Bioinformatics

被引:79
|
作者
Lin, Xiaohui [1 ]
Li, Chao [1 ]
Zhang, Yanhui [1 ]
Su, Benzhe [1 ]
Fan, Meng [1 ]
Wei, Hai [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
来源
MOLECULES | 2018年 / 23卷 / 01期
基金
中国国家自然科学基金;
关键词
SVM-RFE; overlapping degree; feature selection; GENE SELECTION; CANCER CLASSIFICATION; BIOMARKER DISCOVERY; TUMOR; PREDICTION; SYSTEM;
D O I
10.3390/molecules23010052
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Feature selection is an important topic in bioinformatics. Defining informative features from complex high dimensional biological data is critical in disease study, drug development, etc. Support vector machine-recursive feature elimination (SVM-RFE) is an efficient feature selection technique that has shown its power in many applications. It ranks the features according to the recursive feature deletion sequence based on SVM. In this study, we propose a method, SVM-RFE-OA, which combines the classification accuracy rate and the average overlapping ratio of the samples to determine the number of features to be selected from the feature rank of SVM-RFE. Meanwhile, to measure the feature weights more accurately, we propose a modified SVM-RFE-OA (M-SVM-RFE-OA) algorithm that temporally screens out the samples lying in a heavy overlapping area in each iteration. The experiments on the eight public biological datasets show that the discriminative ability of the feature subset could be measured more accurately by combining the classification accuracy rate with the average overlapping degree of the samples compared with using the classification accuracy rate alone, and shielding the samples in the overlapping area made the calculation of the feature weights more stable and accurate. The methods proposed in this study can also be used with other RFE techniques to define potential biomarkers from big biological data.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Multi-scoring Feature selection method based on SVM-RFE for prostate cancer diagnosis
    Albashish, Dheeb
    Sahran, Shahnorbanun
    Abdullah, Azizi
    Adam, Afzan
    Abd Shukor, Nordashima
    Pauzi, Suria Hayati Md
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 682 - 686
  • [22] Hepatitis Detection using Random Forest based on SVM-RFE (Recursive Feature Elimination) Feature Selection and SMOTE
    Krisnabayu, Rifky Yunus
    Ridok, Achmad
    Budi, Agung Setia
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 151 - 156
  • [23] Absolute cosine-based SVM-RFE feature selection method for prostate histopathological grading
    Sahran, Shahnorbanun
    Albashish, Dheeb
    Abdullah, Azizi
    Abd Shukor, Nordashima
    Pauzi, Suria Hayati Md
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 87 : 78 - 90
  • [24] On feature selection and blast furnace temperature tendency prediction in hot metal based on SVM-RFE
    Wang, Yi-Kang
    Liu, Xue-Yi
    Zhang, Bao-Lin
    2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 371 - 376
  • [25] Feature reduction using SVM-RFE technique to detect autism spectrum disorder
    Mohan, Priya
    Paramasivam, Ilango
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 989 - 997
  • [26] Multiple SVM-RFE for Feature Subset Selection in Partial Discharge Pattern Recognition
    Tang, Ju
    Tao, Jiagui
    Zhang, Xiaoxing
    Liu, Fan
    INTERNATIONAL REVIEW OF ELECTRICAL ENGINEERING-IREE, 2012, 7 (04): : 5240 - 5246
  • [27] Nonlinear feature selection using Gaussian kernel SVM-RFE for fault diagnosis
    Yangtao Xue
    Li Zhang
    Bangjun Wang
    Zhao Zhang
    Fanzhang Li
    Applied Intelligence, 2018, 48 : 3306 - 3331
  • [28] Improving enzyme regulatory protein classification by means of SVM-RFE feature selection
    Fernandez-Lozano, Carlos
    Fernandez-Blanco, Enrique
    Dave, Kirtan
    Pedreira, Nieves
    Gestal, Marcos
    Dorado, Julian
    Munteanu, Cristian R.
    MOLECULAR BIOSYSTEMS, 2014, 10 (05) : 1063 - 1071
  • [29] ECoG classification based on band power normalization and SVM-RFE
    Liu, Chong
    Zhao, Haibin
    Li, Chunsheng
    Wang, Hong
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2011, 32 (03): : 534 - 539
  • [30] Nonlinear feature selection using Gaussian kernel SVM-RFE for fault diagnosis
    Xue, Yangtao
    Zhang, Li
    Wang, Bangjun
    Zhang, Zhao
    Li, Fanzhang
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3306 - 3331