MaskedPainter: Feature selection for microarray data analysis

被引:11
|
作者
Apiletti, Daniele [1 ]
Baralis, Elena [1 ]
Bruno, Giulia [1 ]
Fiori, Alessandro [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
Feature selection; microarray analysis; tumor classification; data mining; GENE SELECTION; COLON-CANCER; CLASSIFICATION; EXPRESSION; PREDICTION;
D O I
10.3233/IDA-2012-0546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting a small number of discriminative genes from thousands is a fundamental task in microarray data analysis. An effective feature selection allows biologists to investigate only a subset of genes instead of the entire set, thus avoiding insignificant, noisy, and redundant features. This paper presents the Masked Painter feature selection method for gene expression data. The proposed method measures the ability of each gene to classify samples belonging to different classes and ranks genes by computing an overlap score. A density based technique is exploited to smooth the effects of outliers in the overlap score computation. Analogously to other approaches, the number of selected genes can be set by the user. However, our algorithm may automatically detect the minimum set of genes that yields the best classification coverage of training set samples. The effectiveness of our approach has been demonstrated through an empirical study on public microarray datasets with different characteristics. Experimental results show that the proposed approach yields a higher classification accuracy with respect to widely used feature selection techniques.
引用
收藏
页码:717 / 737
页数:21
相关论文
共 50 条
  • [31] Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes
    Thanyaluk Jirapech-Umpai
    Stuart Aitken
    BMC Bioinformatics, 6
  • [32] Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes
    Jirapech-Umpai, T
    Aitken, S
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [33] Feature selection for microarray data analysis using mutual information and rough set theory
    Zhou, Wengang
    Zhou, Chunguang
    Zhu, Hong
    Liu, Guixia
    Chang, Xiaoyu
    COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 424 - 432
  • [34] A multi-objective feature selection and classifier ensemble technique for microarray data analysis
    Dash, Rasmita
    Misra, Bijan Bihari
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (02) : 123 - 160
  • [35] Analysis of Microarray Gene Expression Data Using Various Feature Selection and Classification Techniques
    Singh, W. Jai
    Kavitha, R. K.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 105 - 108
  • [36] Feature selection for microarray data analysis using mutual information and rough set theory
    Zhou, Wengang
    Zhou, Chunguang
    Liu, Guixia
    Zhu, Hong
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 492 - +
  • [37] Improved Microarray Data Analysis using Feature Selection Methods with Machine Learning Methods
    Sun, Jing
    Passi, Kalpdrum
    Jain, Chakresh Kumar
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1527 - 1534
  • [38] A Discernibility-Based Approach to Feature Selection for Microarray Data
    Voulgaris, Zacharias
    Magoulas, George D.
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 818 - 823
  • [39] Exploring the consequences of distributed feature selection in DNA microarray data
    Bolon-Canedo, Veronica
    Sechidis, Konstantinos
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Brown, Gavin
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1665 - 1672
  • [40] Feature selection in independent component subspace for microarray data classification
    Zheng, Chun-Hou
    huang, De-S Huang
    Shang, Li
    NEUROCOMPUTING, 2006, 69 (16-18) : 2407 - 2410