Incremental forward feature selection with application to microarray gene expression data

被引:12
|
作者
Lee, Yuh-Jye [1 ]
Chang, Chien-Chung [1 ]
Chao, Chia-Huang [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
关键词
1-norm support vector machine; filter model; incremental forward feature selection; weight score; wrapper model;
D O I
10.1080/10543400802277868
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In this study, the authors propose a new feature selection scheme, the incremental forward feature selection, which is inspired by incremental reduced support vector machines. In their method, a new feature is added into the current selected feature subset if it will bring in the most extra information. This information is measured by using the distance between the new feature vector and the column space spanned by current feature subset. The incremental forward feature selection scheme can exclude highly linear correlated features that provide redundant information and might degrade the efficiency of learning algorithms. The method is compared with the weight score approach and the 1-norm support vector machine on two well-known microarray gene expression data sets, the acute leukemia and colon cancer data sets. These two data sets have a very few observations but huge number of genes. The linear smooth support vector machine was applied to the feature subsets selected by these three schemes respectively and obtained a slightly better classification results in the 1-norm support vector machine and incremental forward feature selection. Finally, the authors claim that the rest of genes still contain some useful information. The previous selected features are iteratively removed from the data sets and the feature selection and classification steps are repeated for four rounds. The results show that there are many distinct feature subsets that can provide enough information for classification tasks in these two microarray gene expression data sets.
引用
收藏
页码:827 / 840
页数:14
相关论文
共 50 条
  • [1] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [2] Quality of feature selection based on microarray gene expression data
    Maciejewski, Henryk
    [J]. COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 140 - 147
  • [3] Gene ontology driven feature selection from microarray gene expression data
    Qi, Jianlong
    Tang, Jian
    [J]. PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 428 - +
  • [4] Minimum redundancy feature selection from microarray gene expression data
    Ding, C
    Peng, HC
    [J]. PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 523 - 528
  • [5] Feature selection methods in microarray gene expression data: a systematic mapping study
    Mahnaz Vahmiyan
    Mohammadtaghi Kheirabadi
    Ebrahim Akbari
    [J]. Neural Computing and Applications, 2022, 34 : 19675 - 19702
  • [6] Feature selection methods in microarray gene expression data: a systematic mapping study
    Vahmiyan, Mahnaz
    Kheirabadi, Mohammadtaghi
    Akbari, Ebrahim
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19675 - 19702
  • [7] A Top-r Feature Selection Algorithm for Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 754 - 764
  • [8] Combination of Feature Selection Methods for the Effective Classification of Microarray Gene Expression Data
    Sheela, T.
    Rangarajan, Lalitha
    [J]. RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 137 - 145
  • [9] Feature Selection in Microarray Gene Expression Data Using Fisher Discriminant Ratio
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    Abadi, Mehdi Irannejad Najaf
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 225 - 230
  • [10] A novel forward gene selection algorithm for microarray data
    Du, Dajun
    Li, Kang
    Li, Xue
    Fei, Minrui
    [J]. NEUROCOMPUTING, 2014, 133 : 446 - 458