A novel hybrid feature selection method for microarray data analysis

被引:121
|
作者
Lee, Chien-Pang [1 ]
Leu, Yungho [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Informat Management, Taipei 106, Taiwan
关键词
Feature selection; Hybrid method; Genetic algorithm; chi(2)-Test for homogeneity; Microarray data analysis; SUPPORT VECTOR MACHINE; MULTIPLE CANCER TYPES; GENE-EXPRESSION DATA; SAMPLE CLASSIFICATION; PREDICTION; DIAGNOSIS; TUMOR;
D O I
10.1016/j.asoc.2009.11.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many methods have been proposed for microarray data analysis. One of the challenges for microarray applications is to select a proper number of the most relevant genes for data analysis. In this paper, we propose a novel hybrid method for feature selection in microarray data analysis. This method first uses a genetic algorithm with dynamic parameter setting (GADP) to generate a number of subsets of genes and to rank the genes according to their occurrence frequencies in the gene subsets. Then, this method uses the chi(2)-test for homogeneity to select a proper number of the top-ranked genes for data analysis. We use the support vector machine (SVM) to verify the efficiency of the selected genes. Six different microarray datasets are used to compare the performance of the GADP method with the existing methods. The experimental results show that the GADP method is better than the existing methods in terms of the number of selected genes and the prediction accuracy. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:208 / 213
页数:6
相关论文
共 50 条
  • [21] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    [J]. Progress in Natural Science:Materials International, 2009, 19 (10) : 1365 - 1371
  • [22] Prominent feature selection of microarray data
    Liu, Yihui
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [23] Wavelet feature selection for microarray data
    Liu, Yihui
    [J]. 2007 IEEE/NIH LIFE SCIENCE SYSTEMS AND APPLICATIONS WORKSHOP, 2007, : 205 - 208
  • [24] Effective feature selection framework for cluster analysis of microarray data
    Pok, Gouchol
    Liu, Jyh-Charn Steve
    Ryu, Keun Ho
    [J]. BIOINFORMATION, 2010, 4 (08) : 385 - 389
  • [25] Microarray classification with hierarchical data representation and novel feature selection criteria
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    [J]. IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 344 - 349
  • [26] A two-stage hybrid approach for feature selection in microarray analysis
    Lee, Chung-Hong
    Yang, Hsin-Chang
    Wu, Chih-Hong
    Lan, Yi-Chia
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2009, : 188 - +
  • [27] Noise-based feature perturbation as a selection method for microarray data
    Chen, Li
    Goldgof, Dmitry B.
    Hall, Lawrence O.
    Eschrich, Steven A.
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 237 - +
  • [28] An ensemble feature selection method based on mRMR for paired microarray data
    He, Lihua
    Cao, Zhongbo
    Wang, Yan
    Du, Wei
    Liang, Yanchun
    [J]. Journal of Computational Information Systems, 2014, 10 (11): : 4875 - 4882
  • [29] A method for feature selection on microarray data using support vector machine
    Huang, Xiao Bing
    Tang, Jian
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 513 - 523
  • [30] Feature Selection for high Dimensional DNA Microarray data using hybrid approaches
    Kumar, Ammu Prasanna
    Valsala, Preeja
    [J]. BIOINFORMATION, 2013, 9 (16) : 824 - 828