Gene subset selection in microarray data using entropic filtering for cancer classification

被引:12
|
作者
Navarro, Felix F. Gonzalez [1 ]
Munoz, Lluis A. Belanche [1 ]
机构
[1] Univ Politecn Catalonia, Languages & Informat Syst Dept, Barcelona 08034, Spain
关键词
microarray gene expression; feature selection; cancer classification; EXPRESSION DATA; SVM-RFE; DISCOVERY; MIGRATION; STRATEGY;
D O I
10.1111/j.1468-0394.2008.00489.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work an entropic filtering algorithm (EFA) for feature selection is described, as a workable method to generate a relevant subset of genes. This is a fast feature selection method based on finding feature subsets that jointly maximize the normalized multivariate conditional entropy with respect to the classification ability of tumours. The EFA is tested in combination with several machine learning algorithms on five public domain microarray data sets. It is found that this combination offers subsets yielding similar or much better accuracies than using the full set of genes. The solutions obtained are of comparable quality to previous results, but they are obtained in a maximum of half an hour computing time and use a very low number of genes.
引用
下载
收藏
页码:113 / 124
页数:12
相关论文
共 50 条
  • [31] ON GENE SELECTION AND CLASSIFICATION FOR CANCER MICROARRAY DATA USING MULTI-STEP CLUSTERING AND SPARSE REPRESENTATION
    Jing, Liping
    Ng, Michael K.
    Zeng, Tieyong
    ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2011, 3 (1-2) : 127 - 148
  • [32] Ensemble gene selection by grouping for microarray data classification
    Liu, Huawen
    Liu, Lei
    Zhang, Huijie
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (01) : 81 - 87
  • [33] Advances in metaheuristics for gene selection and classification of microarray data
    Duval, Beatrice
    Hao, Jin-Kao
    BRIEFINGS IN BIOINFORMATICS, 2010, 11 (01) : 127 - 141
  • [34] A STUDY ON GENE SELECTION AND CLASSIFICATION ALGORITHMS FOR CLASSIFICATION OF MICROARRAY GENE EXPRESSION DATA
    Chin, Yeo Lee
    Deris, Safaai
    JURNAL TEKNOLOGI, 2005, 43
  • [35] Random forest for gene selection and microarray data classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    BIOINFORMATION, 2011, 7 (03) : 142 - 146
  • [36] Random Forest for Gene Selection and Microarray Data Classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    KNOWLEDGE TECHNOLOGY, 2012, 295 : 174 - 183
  • [37] Gene Selection for Microarray Data Classification Using Hybrid Meta-Heuristics
    Dif, Nassima
    Attaoui, Mohamed Walid
    Elberrichi, Zakaria
    MODELLING AND IMPLEMENTATION OF COMPLEX SYSTEMS, 2019, 64 : 119 - 132
  • [38] Gene selection for microarray data classification using a novel ant colony optimization
    Tabakhi, Sina
    Najafi, Ali
    Ranjbar, Reza
    Moradi, Parham
    NEUROCOMPUTING, 2015, 168 : 1024 - 1036
  • [39] Gene Subset Selection for Cancer Classification Using Statsitical and Rough Set Approach
    Das, Asit Kumar
    Pati, Soumen Kumar
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, (SEMCCO 2012), 2012, 7677 : 294 - +
  • [40] MiRNA subset selection for microarray data classification using grey wolf optimizer and evolutionary population dynamics
    Khaled H. Almotairi
    Neural Computing and Applications, 2023, 35 : 18737 - 18761