Gene selection and classification using Taguchi chaotic binary particle swarm optimization

被引:63
|
作者
Chuang, Li-Yeh [2 ]
Yang, Cheng-San [1 ]
Wu, Kuo-Chuan [3 ]
Yang, Cheng-Hong [4 ,5 ]
机构
[1] Chia Yi Christian Hosp, Dept Plast Surg, Chiayi 60002, Taiwan
[2] I Shou Univ, Inst Biotechnol & Chem Engn, Kaohsiung 80041, Taiwan
[3] Natl Kaohsiung Univ Appl Sci, Dept Comp Sci & Informat Engn, Kaohsiung 80708, Taiwan
[4] Toko Univ, Dept Network Syst, Chiayi 61363, Taiwan
[5] Natl Kaohsiung Univ Appl Sci, Dept Elect Engn, Kaohsiung 80708, Taiwan
关键词
Microarray data; Correlation-based feature selection; Taguchi-binary particle swarm optimization; K-nearest neighbor; MICROARRAY DATA; CANCER CLASSIFICATION; ALGORITHM; PREDICTION; COMBINATION; PATTERNS; CHOICE; FILTER; TUMOR;
D O I
10.1016/j.eswa.2011.04.165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of gene expression analysis is to discriminate between classes of samples, and to predict the relative importance of each gene for sample classification. Microarray data with reference to gene expression profiles have provided some valuable results related to a variety of problems and contributed to advances in clinical medicine. Microarray data characteristically have a high dimension and a small sample size. This makes it difficult for a general classification method to obtain correct data for classification. However, not every gene is potentially relevant for distinguishing the sample class. Thus, in order to analyze gene expression profiles correctly, feature (gene) selection is crucial for the classification process, and an effective gene extraction method is necessary for eliminating irrelevant genes and decreasing the classification error rate. In this paper, correlation-based feature selection (CFS) and the Taguchi chaotic binary particle swarm optimization (TCBPSO) were combined into a hybrid method. The K-nearest neighbor (K-NN) with leave-one-out cross-validation (LOOCV) method served as a classifier for ten gene expression profiles. Experimental results show that this hybrid method effectively simplifies features selection by reducing the number of features needed. The classification error rate obtained by the proposed method had the lowest classification error rate for all of the ten gene expression data set problems tested. For six of the gene expression profile data sets a classification error rate of zero could be reached. The introduced method outperformed five other methods from the literature in terms of classification error rate. It could thus constitute a valuable tool for gene expression analysis in future studies. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:13367 / 13377
页数:11
相关论文
共 50 条
  • [1] Chaotic binary particle swarm optimization for feature selection using logistic map
    Chuang, Li-Yeh
    Li, Jung-Chike
    Yang, Cheng-Hong
    [J]. IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 131 - +
  • [2] Chaotic Maps in Binary Particle Swarm Optimization for Feature Selection
    Yang, Cheng-San
    Chuang, Li-Yeh
    Li, Jung-Chike
    Yang, Cheng-Hong
    [J]. 2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 107 - +
  • [3] An improved Binary Particle Swarm Optimization (iBPSO) for Gene Selection and Cancer Classification using DNA Microarrays
    Jain, Indu
    Jaint, Vinod Kumar
    Jain, Renu
    [J]. 2018 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT'18), 2018,
  • [4] Image classification using chaotic particle swarm optimization
    Chandramouli, Krishna
    Izquierdo, Ebroul
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 3001 - +
  • [5] Chaotic maps based on binary particle swarm optimization for feature selection
    Chuang, Li-Yeh
    Yang, Cheng-Hong
    Li, Jung-Chike
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (01) : 239 - 248
  • [6] Feature Selection using Feature Ranking, Correlation Analysis and Chaotic Binary Particle Swarm Optimization
    Wang, Fei
    Yang, Yi
    Lv, Xianchao
    Xu, Jiao
    Li, Lian
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 305 - 309
  • [7] Binary Particle Swarm Optimization for Variables Selection Optimization in Taguchi's T-Method
    Harudin, N.
    Jamaludin, K. R.
    Ramlie, F.
    Muhtazaruddin, M. N.
    Razali, Che Munira Che
    Muhamad, W. Z. A. W.
    [J]. MATEMATIKA, 2020, 36 (01) : 69 - 84
  • [8] Effects of Feature Selection Using Binary Particle Swarm Optimization on Wheat Variety Classification
    Babalik, Ahmet
    Baykan, Omer Kaan
    Iscan, Hazim
    Babaoglu, Ismail
    Findik, Oguz
    [J]. ADVANCES IN INFORMATION TECHNOLOGY, 2010, 114 : 11 - 17
  • [9] Binary particle swarm optimization in classification
    Cervantes, A
    Galván, I
    Isasi, P
    [J]. NEURAL NETWORK WORLD, 2005, 15 (03) : 229 - 241
  • [10] Feature Selection for Classification Using Particle Swarm Optimization
    Brezocnik, Lucija
    [J]. 17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, : 966 - 971