MaskedPainter: Feature selection for microarray data analysis

被引:11
|
作者
Apiletti, Daniele [1 ]
Baralis, Elena [1 ]
Bruno, Giulia [1 ]
Fiori, Alessandro [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
Feature selection; microarray analysis; tumor classification; data mining; GENE SELECTION; COLON-CANCER; CLASSIFICATION; EXPRESSION; PREDICTION;
D O I
10.3233/IDA-2012-0546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting a small number of discriminative genes from thousands is a fundamental task in microarray data analysis. An effective feature selection allows biologists to investigate only a subset of genes instead of the entire set, thus avoiding insignificant, noisy, and redundant features. This paper presents the Masked Painter feature selection method for gene expression data. The proposed method measures the ability of each gene to classify samples belonging to different classes and ranks genes by computing an overlap score. A density based technique is exploited to smooth the effects of outliers in the overlap score computation. Analogously to other approaches, the number of selected genes can be set by the user. However, our algorithm may automatically detect the minimum set of genes that yields the best classification coverage of training set samples. The effectiveness of our approach has been demonstrated through an empirical study on public microarray datasets with different characteristics. Experimental results show that the proposed approach yields a higher classification accuracy with respect to widely used feature selection techniques.
引用
收藏
页码:717 / 737
页数:21
相关论文
共 50 条
  • [1] Boosting for Feature Selection for Microarray Data Analysis
    Guile, Geoffrey R.
    Wang, Wenjia
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2559 - 2563
  • [2] Feature Selection for Microarray Data by AUC Analysis
    Canul-Reich, Juana
    Hall, Lawrence O.
    Goldgof, Dmitry
    Eschrich, Steven A.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 768 - +
  • [3] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    [J]. Progress in Natural Science:Materials International, 2009, 19 (10) : 1365 - 1371
  • [4] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469
  • [5] Wavelet feature selection for microarray data
    Liu, Yihui
    [J]. 2007 IEEE/NIH LIFE SCIENCE SYSTEMS AND APPLICATIONS WORKSHOP, 2007, : 205 - 208
  • [6] Prominent feature selection of microarray data
    Liu, Yihui
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [7] A novel hybrid feature selection method for microarray data analysis
    Lee, Chien-Pang
    Leu, Yungho
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (01) : 208 - 213
  • [8] Effective feature selection framework for cluster analysis of microarray data
    Pok, Gouchol
    Liu, Jyh-Charn Steve
    Ryu, Keun Ho
    [J]. BIOINFORMATION, 2010, 4 (08) : 385 - 389
  • [9] On biclustering with feature selection for microarray data sets
    Pardalos, Pangs M.
    Busygin, Stanislav
    Prokopyev, Oleg A.
    [J]. BIOMAT 2005, 2006, : 367 - 377
  • [10] Memetic algorithms for feature selection on microarray data
    Zhu, Zexuan
    Ong, Yew-Soon
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 1327 - +