A hybrid feature selection method for DNA microarray data

被引:73
|
作者
Chuang, Li-Yeh [3 ]
Yang, Cheng-Huei [4 ]
Wu, Kuo-Chuan [5 ]
Yang, Cheng-Hong [1 ,2 ,4 ,5 ]
机构
[1] Natl Kaohsiung Univ Appl Sci, Dept Elect Engn, Kaohsiung 80708, Taiwan
[2] Toko Univ, Dept Network Syst, Chiayi 61363, Taiwan
[3] I Shou Univ, Dept Chem Engn, Kaohsiung 80041, Taiwan
[4] Natl Kaohsiung Marine Univ, Dept Elect Commun Engn, Kaohsiung 81157, Taiwan
[5] Natl Kaohsiung Univ Appl Sci, Dept Comp Sci & Informat Engn, Kaohsiung 80708, Taiwan
关键词
Feature selection; Taguchi-genetic algorithm; K-nearest neighbor; Leave-one-out cross-validation; GENE SELECTION; TABU SEARCH; CLASSIFICATION; ALGORITHM; OPTIMIZATION; ENSEMBLE; CHOICE;
D O I
10.1016/j.compbiomed.2011.02.004
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression profiles, which represent the state of a cell at a molecular level, have great potential as a medical diagnosis tool. In cancer classification, available training data sets are generally of a fairly small sample size compared to the number of genes involved. Along with training data limitations, this constitutes a challenge to certain classification methods. Feature (gene) selection can be used to successfully extract those genes that directly influence classification accuracy and to eliminate genes which have no influence on it. This significantly improves calculation performance and classification accuracy. In this paper, correlation-based feature selection (CFS) and the Taguchi-genetic algorithm (TGA) method were combined into a hybrid method, and the K-nearest neighbor (KNN) with the leave-one-out cross-validation (LOOCV) method served as a classifier for eleven classification profiles to calculate the classification accuracy. Experimental results show that the proposed method reduced redundant features effectively and achieved superior classification accuracy. The classification accuracy obtained by the proposed method was higher in ten out of the eleven gene expression data set test problems when compared to other classification methods from the literature. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:228 / 237
页数:10
相关论文
共 50 条
  • [1] A novel hybrid feature selection method for microarray data analysis
    Lee, Chien-Pang
    Leu, Yungho
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (01) : 208 - 213
  • [2] A hybrid feature selection algorithm for microarray data
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (05): : 3494 - 3526
  • [3] A hybrid feature selection algorithm for microarray data
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. The Journal of Supercomputing, 2020, 76 : 3494 - 3526
  • [4] Feature Selection for high Dimensional DNA Microarray data using hybrid approaches
    Kumar, Ammu Prasanna
    Valsala, Preeja
    [J]. BIOINFORMATION, 2013, 9 (16) : 824 - 828
  • [5] A Composite Method for Feature Selection of Microarray Data
    Li, Zejun
    Yang, Ang
    Chen, Xia
    Zeng, Lijun
    Cao, Tao
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (02) : 472 - 476
  • [6] IG-GA: A Hybrid Filter/Wrapper Method for Feature Selection of Microarray Data
    Yang, Cheng-Huei
    Chuang, Li-Yeh
    Yang, Cheng-Hong
    [J]. JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2010, 30 (01) : 23 - 28
  • [7] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [8] Exploring the consequences of distributed feature selection in DNA microarray data
    Bolon-Canedo, Veronica
    Sechidis, Konstantinos
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Brown, Gavin
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1665 - 1672
  • [9] A Novel Hybrid Method for Gene Selection of Microarray Data
    Liao, Bo
    Cao, Tao
    Lu, Xinguo
    Zhu, Wen
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2012, 9 (01) : 5 - 9
  • [10] A Novel Hybrid Method for Gene Selection of Microarray Data
    Wu, Ronghui
    Liu, Yun
    Li, Renfa
    Cao, Tao
    Yue, Guangxue
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2011, 8 (07) : 1162 - 1165