Selecting Feature Subsets Based on SVM-RFE and the Overlapping Ratio with Applications in Bioinformatics

被引:79
|
作者
Lin, Xiaohui [1 ]
Li, Chao [1 ]
Zhang, Yanhui [1 ]
Su, Benzhe [1 ]
Fan, Meng [1 ]
Wei, Hai [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
来源
MOLECULES | 2018年 / 23卷 / 01期
基金
中国国家自然科学基金;
关键词
SVM-RFE; overlapping degree; feature selection; GENE SELECTION; CANCER CLASSIFICATION; BIOMARKER DISCOVERY; TUMOR; PREDICTION; SYSTEM;
D O I
10.3390/molecules23010052
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Feature selection is an important topic in bioinformatics. Defining informative features from complex high dimensional biological data is critical in disease study, drug development, etc. Support vector machine-recursive feature elimination (SVM-RFE) is an efficient feature selection technique that has shown its power in many applications. It ranks the features according to the recursive feature deletion sequence based on SVM. In this study, we propose a method, SVM-RFE-OA, which combines the classification accuracy rate and the average overlapping ratio of the samples to determine the number of features to be selected from the feature rank of SVM-RFE. Meanwhile, to measure the feature weights more accurately, we propose a modified SVM-RFE-OA (M-SVM-RFE-OA) algorithm that temporally screens out the samples lying in a heavy overlapping area in each iteration. The experiments on the eight public biological datasets show that the discriminative ability of the feature subset could be measured more accurately by combining the classification accuracy rate with the average overlapping degree of the samples compared with using the classification accuracy rate alone, and shielding the samples in the overlapping area made the calculation of the feature weights more stable and accurate. The methods proposed in this study can also be used with other RFE techniques to define potential biomarkers from big biological data.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Mutual information-based SVM-RFE for diagnostic classification of digitized mammograms
    Yoon, Sejong
    Kim, Saejoon
    PATTERN RECOGNITION LETTERS, 2009, 30 (16) : 1489 - 1495
  • [42] Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
    Satoshi Niijima
    Satoru Kuhara
    BMC Bioinformatics, 7
  • [43] Factor analysis of individual factors associated with low arch height ratio of the foot by SVM-RFE
    Nakao H.
    Imaoka M.
    Oka K.
    Morifuji T.
    Hashimoto M.
    Matsumoto K.
    Kita K.
    Transactions of Japanese Society for Medical and Biological Engineering, 2019, 57 (06) : 190 - 197
  • [44] Identification of Autism Based on SVM-RFE and Stacked Sparse Auto-Encoder
    Wang, Canhua
    Xiao, Zhiyong
    Wang, Baoyu
    Wu, Jianhua
    IEEE ACCESS, 2019, 7 : 118030 - 118036
  • [45] A novel SVM-RFE based biomedical data processing approach: basic and beyond
    Yin, Zuyu
    Fei, Zhongyang
    Yang, Chengming
    Chen, Ao
    PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 7143 - 7148
  • [46] Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
    Niijima, Satoshi
    Kuhara, Satoru
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [47] An optimized SVM-RFE based feature selection and weighted entropy K-means approach for big data clustering in mapreduce
    Madan, Suman
    Komalavalli, C.
    Bhatia, Manjot Kaur
    Laroiya, Chetna
    Arora, Monika
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 74233 - 74254
  • [48] RETRACTED: Research on Complex Classification Algorithm of Breast Cancer Chip Based on SVM-RFE Gene Feature Screening (Retracted Article)
    Chen, Guobin
    Xie, Xianzhong
    Li, Shijin
    COMPLEXITY, 2020, 2020
  • [49] MRI-BASED CLASSIFICATION OF BRAIN TUMOR TYPE AND GRADE USING SVM-RFE
    Zacharaki, Evangelia I.
    Wang, Sumei
    Chawla, Sanjeev
    Yoo, Dong Soo
    Wolf, Ronald
    Melhem, Elias R.
    Davatzikos, Christos
    2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 1035 - 1038
  • [50] Support Vector Machine - Recursive Feature Elimination (SVM-RFE) for Selection of MicroRNA Expression Features of Breast Cancer
    Adorada, Amazona
    Permatasari, Ratih
    Wirawan, Panji Wisnu
    Wibowo, Adi
    Sujiwo, Adi
    2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 165 - 168