Efficient feature selection and classification for microarray data

被引:50
|
作者
Li, Zifa [1 ]
Xie, Weibo [1 ]
Liu, Tao [1 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Fujian, Peoples R China
来源
PLOS ONE | 2018年 / 13卷 / 08期
基金
中国国家自然科学基金;
关键词
GENE SELECTION; SVM-RFE; CANCER; PREDICTION; ALGORITHM; PATTERNS; TUMOR;
D O I
10.1371/journal.pone.0202167
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection and classification are the main topics in microarray data analysis. Although many feature selection methods have been proposed and developed in this field, SVM-RFE (Support Vector Machine based on Recursive Feature Elimination) is proved as one of the best feature selection methods, which ranks the features (genes) by training support vector machine classification model and selects key genes combining with recursive feature elimination strategy. The principal drawback of SVM-RFE is the huge time consumption. To overcome this limitation, we introduce a more efficient implementation of linear support vector machines and improve the recursive feature elimination strategy and then combine them together to select informative genes. Besides, we propose a simple resampling method to preprocess the datasets, which makes the information distribution of different kinds of samples balanced and the classification results more credible. Moreover, the applicability of four common classifiers is also studied in this paper. Extensive experiments are conducted on six most frequently used microarray datasets in this field, and the results show that the proposed methods have not only reduced the time consumption greatly but also obtained comparable classification performance.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [21] Novel and efficient method on feature selection and data classification
    Chen, Tieming
    Ma, Jixia
    Huang, Samuel H.
    Cai, Jiamei
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2012, 49 (04): : 735 - 745
  • [22] An efficient feature selection and classification system for microarray cancer data using genetic algorithm and deep belief networks
    Morolake Oladayo Lawrence
    Rasheed Gbenga Jimoh
    Waheed Babatunde Yahya
    Multimedia Tools and Applications, 2025, 84 (8) : 4393 - 4434
  • [23] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    Progress in Natural Science:Materials International, 2009, 19 (10) : 1365 - 1371
  • [24] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469
  • [25] Wavelet feature selection for microarray data
    Liu, Yihui
    2007 IEEE/NIH LIFE SCIENCE SYSTEMS AND APPLICATIONS WORKSHOP, 2007, : 205 - 208
  • [26] Prominent feature selection of microarray data
    Liu, Yihui
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [27] A Clustering Approach for Feature Selection in Microarray Data Classification Using Random forest
    Aydadenta, Husna
    Adiwijaya
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05): : 1167 - 1175
  • [28] Microarray Lung Cancer Data Classification Using Similarity based Feature Selection
    Amrane, Meriem
    Oukid, Saliha
    Ensari, Tolga
    Benblidia, Nadjia
    Orman, Zeynep
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [29] Comparison of population based metaheuristics for feature selection:: Application to microarray data classification
    Talbi, E-G.
    Jourdan, L.
    Garcia-Nieto, J.
    Alba, E.
    2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 45 - +
  • [30] A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification
    Sun, Shiquan
    Peng, Qinke
    Shakoor, Adnan
    PLOS ONE, 2014, 9 (07):