A novel hybrid feature selection method for microarray data analysis

被引:121
|
作者
Lee, Chien-Pang [1 ]
Leu, Yungho [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Informat Management, Taipei 106, Taiwan
关键词
Feature selection; Hybrid method; Genetic algorithm; chi(2)-Test for homogeneity; Microarray data analysis; SUPPORT VECTOR MACHINE; MULTIPLE CANCER TYPES; GENE-EXPRESSION DATA; SAMPLE CLASSIFICATION; PREDICTION; DIAGNOSIS; TUMOR;
D O I
10.1016/j.asoc.2009.11.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many methods have been proposed for microarray data analysis. One of the challenges for microarray applications is to select a proper number of the most relevant genes for data analysis. In this paper, we propose a novel hybrid method for feature selection in microarray data analysis. This method first uses a genetic algorithm with dynamic parameter setting (GADP) to generate a number of subsets of genes and to rank the genes according to their occurrence frequencies in the gene subsets. Then, this method uses the chi(2)-test for homogeneity to select a proper number of the top-ranked genes for data analysis. We use the support vector machine (SVM) to verify the efficiency of the selected genes. Six different microarray datasets are used to compare the performance of the GADP method with the existing methods. The experimental results show that the GADP method is better than the existing methods in terms of the number of selected genes and the prediction accuracy. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:208 / 213
页数:6
相关论文
共 50 条
  • [41] Novel and efficient method on feature selection and data classification
    Chen, Tieming
    Ma, Jixia
    Huang, Samuel H.
    Cai, Jiamei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2012, 49 (04): : 735 - 745
  • [42] Hybrid feature selection using micro genetic algorithm on microarray gene expression data
    Pragadeesh, C.
    Jeyaraj, Rohana
    Siranjeevi, K.
    Abishek, R.
    Jeyakumar, G.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2241 - 2246
  • [43] A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification
    Sun, Shiquan
    Peng, Qinke
    Shakoor, Adnan
    [J]. PLOS ONE, 2014, 9 (07):
  • [44] A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data
    Venkatesh, B.
    Anuradha, J.
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (04) : 289 - 301
  • [45] A Hybrid Feature Selection Based on Fisher Score and SVM-RFE for Microarray Data
    Hamla H.
    Ghanem K.
    [J]. Informatica (Slovenia), 2024, 48 (01): : 57 - 68
  • [46] Hybrid Information Gain Based Fuzzy Roughset Feature Selection in Cancer Microarray Data
    Chinnaswamy, Arunkumar
    Srinivasan, Ramakrishnan
    [J]. 2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [47] Clustering-based hybrid feature selection approach for high dimensional microarray data
    Babu, Samson Anosh P.
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [48] A novel feature extraction approach based on ensemble feature selection and modified discriminant independent component analysis for microarray data classification
    Mollaee, Maryam
    Moattar, Mohammad Hossein
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2016, 36 (03) : 521 - 529
  • [49] Hybrid feature selection method for gene expression analysis
    Wang, Liping
    Han, B. L.
    [J]. ELECTRONICS LETTERS, 2014, 50 (18) : 1269 - 1270
  • [50] Sentiment Analysis on Big Data: A Hybrid SED-TABU Feature Selection Method
    Rajagopal, Sabitha
    Jayaprakash, Sreemathy
    Subburathinam, Karthik
    [J]. Tehnicki Vjesnik, 2024, 31 (06): : 2079 - 2086