MaskedPainter: Feature selection for microarray data analysis

被引:11
|
作者
Apiletti, Daniele [1 ]
Baralis, Elena [1 ]
Bruno, Giulia [1 ]
Fiori, Alessandro [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
Feature selection; microarray analysis; tumor classification; data mining; GENE SELECTION; COLON-CANCER; CLASSIFICATION; EXPRESSION; PREDICTION;
D O I
10.3233/IDA-2012-0546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting a small number of discriminative genes from thousands is a fundamental task in microarray data analysis. An effective feature selection allows biologists to investigate only a subset of genes instead of the entire set, thus avoiding insignificant, noisy, and redundant features. This paper presents the Masked Painter feature selection method for gene expression data. The proposed method measures the ability of each gene to classify samples belonging to different classes and ranks genes by computing an overlap score. A density based technique is exploited to smooth the effects of outliers in the overlap score computation. Analogously to other approaches, the number of selected genes can be set by the user. However, our algorithm may automatically detect the minimum set of genes that yields the best classification coverage of training set samples. The effectiveness of our approach has been demonstrated through an empirical study on public microarray datasets with different characteristics. Experimental results show that the proposed approach yields a higher classification accuracy with respect to widely used feature selection techniques.
引用
收藏
页码:717 / 737
页数:21
相关论文
共 50 条
  • [21] Graph Based Unsupervised Feature Selection for Microarray Data
    Swarnkar, Tripti
    Mitra, Pabitra
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [22] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158
  • [23] Comparative study of feature selection methods on microarray data
    Miyamoto, T
    Uchimura, S
    Hamamoto, Y
    Iizuka, N
    Oka, M
    Yamada-Okabe, H
    IEEE EMBS APBME 2003, 2003, : 82 - 83
  • [24] FEATURE SELECTION FOR MICROARRAY DATA USING PROBABILITY DISTANCES
    Korenblat, K.
    Volkovich, Z.
    JP JOURNAL OF BIOSTATISTICS, 2012, 7 (01) : 15 - 34
  • [25] Distributed feature selection: An application to microarray data classification
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    APPLIED SOFT COMPUTING, 2015, 30 : 136 - 150
  • [26] A Robust and Efficient Feature Selection Algorithm for Microarray Data
    Bari, Mehrab Ghanat
    Salekin, Sirajul
    Zhang, Jianqiu
    MOLECULAR INFORMATICS, 2017, 36 (04)
  • [27] Distance based feature selection for clustering microarray data
    Dash, Manoranjan
    Gopalkrishnan, Vivekanand
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 512 - 519
  • [28] Fostering biological relevance in feature selection for microarray data
    Berens, M
    Liu, H
    Parsons, L
    Zhao, Z
    Yu, L
    IEEE INTELLIGENT SYSTEMS, 2005, 20 (06) : 71 - 73
  • [29] A novel multi-stage feature selection method for microarray expression data analysis
    Du, Wei
    Sun, Ying
    Wang, Yan
    Cao, Zhongbo
    Zhang, Chen
    Liang, Yanchun
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2013, 7 (01) : 58 - 77
  • [30] Evolutionary search of thresholds for robust feature set selection: Application to the analysis of microarray data
    Cotta, C
    Sloper, C
    Moscato, P
    APPLICATIONS OF EVOLUTIONARY COMPUTING, 2004, 3005 : 21 - 30