Efficient feature selection and classification for microarray data

被引:50
|
作者
Li, Zifa [1 ]
Xie, Weibo [1 ]
Liu, Tao [1 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Fujian, Peoples R China
来源
PLOS ONE | 2018年 / 13卷 / 08期
基金
中国国家自然科学基金;
关键词
GENE SELECTION; SVM-RFE; CANCER; PREDICTION; ALGORITHM; PATTERNS; TUMOR;
D O I
10.1371/journal.pone.0202167
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature selection and classification are the main topics in microarray data analysis. Although many feature selection methods have been proposed and developed in this field, SVM-RFE (Support Vector Machine based on Recursive Feature Elimination) is proved as one of the best feature selection methods, which ranks the features (genes) by training support vector machine classification model and selects key genes combining with recursive feature elimination strategy. The principal drawback of SVM-RFE is the huge time consumption. To overcome this limitation, we introduce a more efficient implementation of linear support vector machines and improve the recursive feature elimination strategy and then combine them together to select informative genes. Besides, we propose a simple resampling method to preprocess the datasets, which makes the information distribution of different kinds of samples balanced and the classification results more credible. Moreover, the applicability of four common classifiers is also studied in this paper. Extensive experiments are conducted on six most frequently used microarray datasets in this field, and the results show that the proposed methods have not only reduced the time consumption greatly but also obtained comparable classification performance.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [1] Efficient gene selection for classification of microarray data
    Ho, SY
    Lee, CC
    Chen, HM
    Huang, HL
    2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 1753 - 1760
  • [2] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158
  • [3] Distributed feature selection: An application to microarray data classification
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    APPLIED SOFT COMPUTING, 2015, 30 : 136 - 150
  • [4] A Robust and Efficient Feature Selection Algorithm for Microarray Data
    Bari, Mehrab Ghanat
    Salekin, Sirajul
    Zhang, Jianqiu
    MOLECULAR INFORMATICS, 2017, 36 (04)
  • [5] Feature selection in independent component subspace for microarray data classification
    Zheng, Chun-Hou
    huang, De-S Huang
    Shang, Li
    NEUROCOMPUTING, 2006, 69 (16-18) : 2407 - 2410
  • [6] Feature selection using differential evolution for microarray data classification
    Prajapati S.
    Das H.
    Gourisaria M.K.
    Discover Internet of Things, 2023, 3 (01):
  • [7] Stable feature selection and classification algorithms for multiclass microarray data
    Student, Sebastian
    Fujarewicz, Krzysztof
    BIOLOGY DIRECT, 2012, 7
  • [8] An enhanced feature selection filter for classification of microarray cancer data
    Mazumder, Dilwar Hussain
    Veilumuthu, Ramachandran
    ETRI JOURNAL, 2019, 41 (03) : 358 - 370
  • [9] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [10] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    Biology Direct, 7