New Gene Selection Method Using Gene Expression Programing Approach on Microarray Data Sets

被引:6
|
作者
Alanni, Russul [1 ]
Hou, Jingyu [1 ]
Azzawi, Hasseeb [1 ]
Xiang, Yong [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Feature selection; Gain ratio (GR); Gene expression programming (GEP); Support vector machine (SVM); PARTICLE SWARM OPTIMIZATION; MOLECULAR CLASSIFICATION; CANCER; PREDICTION; CARCINOMAS;
D O I
10.1007/978-3-319-98693-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection in machine learning and data mining facilitates the optimization of accuracy attained from the classifier with smallest number of features. The use of feature selection in microarray data mining is quite promising. However, usually it is hard to identify and select the feature genes from microarray data sets because multi-class categories and high dimensionality features exist in microarray data with a small-sized sample. Therefore, using good selection approaches to eliminate incomprehensibility and optimize prediction accuracy is becoming necessary, because it will help obtain genes that are relevant to sample classification when investigating large number of genes. In his paper, we propose a new feature selection method for microarray data sets. The method consists of the Gain Ratio (GR) and Improved Gene Expression Programming (IGEP) algorithms which are for gene filtering and feature selection respectively. Support Vector Machine (SVM) alongside with leave-one-out cross-validation (LOOCV) method was used to evaluate the proposed method on eight microarray datasets captured in the literature. The experimental results showed the effectiveness of the proposed method in selecting small number of features while generating higher classification accuracies compared with other existing feature selection approaches.
引用
收藏
页码:17 / 31
页数:15
相关论文
共 50 条
  • [11] The Impact of Gene Selection on Imbalanced Microarray Expression Data
    Kamal, Abu H. M.
    Zhu, Xingquan
    Pandya, Abhijit S.
    Hsu, Sam
    Shoaib, Muhammad
    [J]. BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 259 - 269
  • [12] Artificial neural network classification of microarray data using new hybrid gene selection method
    Aziz, Rabia
    Verma, C. K.
    Jha, Manoj
    Srivastava, Namita
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (01) : 42 - 65
  • [13] An Ensemble Approach for Gene Selection in Gene Expression Data
    Castellanos-Garzon, Jose A.
    Ramos, Juan
    Lopez-Sanchez, Daniel
    de Paz, Juan F.
    [J]. 11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 237 - 247
  • [14] A Novel Hybrid Method for Gene Selection of Microarray Data
    Liao, Bo
    Cao, Tao
    Lu, Xinguo
    Zhu, Wen
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2012, 9 (01) : 5 - 9
  • [15] A Gene Selection Method for Microarray Data Based on Sampling
    Leu, Yungho
    Lee, Chien-Pang
    Tsai, Hui-Yi
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II, 2010, 6422 : 68 - 74
  • [16] A Novel Hybrid Method for Gene Selection of Microarray Data
    Wu, Ronghui
    Liu, Yun
    Li, Renfa
    Cao, Tao
    Yue, Guangxue
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2011, 8 (07) : 1162 - 1165
  • [17] Efficient gene selection with rough sets from gene expression data
    Sun, Lijun
    Miao, Duoqian
    Zhang, Hongyun
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 164 - +
  • [18] A feature selection method using fixed-point algorithm for DNA microarray gene expression data
    Sharma, Alok
    Paliwal, Kuldip K.
    Imoto, Seiya
    Miyano, Satoru
    Sharma, Vandana
    Ananthanarayanan, Rajeshkannan
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (01) : 55 - 59
  • [19] Feature Selection in Microarray Gene Expression Data Using Fisher Discriminant Ratio
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    Abadi, Mehdi Irannejad Najaf
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 225 - 230
  • [20] An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray
    Ramos, Juan
    Castellanos-Garzon, Jose A.
    Gonzalez-Briones, Alfonso
    de Paz, Juan F.
    Corchado, Juan M.
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2017, 9 (01) : 1 - 13