Feature clustering based support vector machine recursive feature elimination for gene selection

被引:95
|
作者
Huang, Xiaojuan [1 ,2 ]
Zhang, Li [1 ,2 ]
Wang, Bangjun [1 ,2 ]
Li, Fanzhang [1 ,2 ]
Zhang, Zhao [1 ,2 ]
机构
[1] Soochow Univ Suzhou, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Soochow Univ Suzhou, Joint Int Res Lab Machine Learning & Neuromorph C, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Support vector machine; Feature selection; Gene clustering; Recursive feature elimination; Gene relevancy; Gene redundancy; SVM-RFE; CANCER CLASSIFICATION; EXPRESSION DATA; DISCOVERY; RELEVANCE; FILTER;
D O I
10.1007/s10489-017-0992-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a DNA microarray dataset, gene expression data often has a huge number of features(which are referred to as genes) versus a small size of samples. With the development of DNA microarray technology, the number of dimensions increases even faster than before, which could lead to the problem of the curse of dimensionality. To get good classification performance, it is necessary to preprocess the gene expression data. Support vector machine recursive feature elimination (SVM-RFE) is a classical method for gene selection. However, SVM-RFE suffers from high computational complexity. To remedy it, this paper enhances SVM-RFE for gene selection by incorporating feature clustering, called feature clustering SVM-RFE (FCSVM-RFE). The proposed method first performs gene selection roughly and then ranks the selected genes. First, a clustering algorithm is used to cluster genes into gene groups, in each which genes have similar expression profile. Then, a representative gene is found to represent a gene group. By doing so, we can obtain a representative gene set. Then, SVM-RFE is applied to rank these representative genes. FCSVM-RFE can reduce the computational complexity and the redundancy among genes. Experiments on seven public gene expression datasets show that FCSVM-RFE can achieve a better classification performance and lower computational complexity when compared with the state-the-art-of methods, such as SVM-RFE.
引用
收藏
页码:594 / 607
页数:14
相关论文
共 50 条
  • [1] Feature clustering based support vector machine recursive feature elimination for gene selection
    Xiaojuan Huang
    Li Zhang
    Bangjun Wang
    Fanzhang Li
    Zhao Zhang
    [J]. Applied Intelligence, 2018, 48 : 594 - 607
  • [2] A hybrid feature selection method combining Gini index and support vector machine with recursive feature elimination for gene expression classification
    Almutiri, Talal
    Saeed, Faisal
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (01) : 41 - 62
  • [3] Hybrid adapted fast correlation FCBF-support vector machine recursive feature elimination for feature selection
    Djellali, Hayet
    Ghoualmi-Zine, Nacira
    Guessoum, Souad
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2020, 14 (03): : 269 - 279
  • [4] A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information
    Lin, Xiaohui
    Yang, Fufang
    Zhou, Lina
    Yin, Peiyuan
    Kong, Hongwei
    Xing, Wenbin
    Lu, Xin
    Jia, Lewen
    Wang, Quancai
    Xu, Guowang
    [J]. JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2012, 910 : 149 - 155
  • [5] Gene selection using Gaussian kernel support vector machine based recursive feature elimination with adaptive kernel width strategy
    Mao, Yong
    Zhou, Xiaobo
    Yin, Zheng
    Pi, Daoying
    Sun, Youxian
    Wong, Stephen T. C.
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2006, 4062 : 799 - 806
  • [6] Recursive Feature Elimination and Least Square Support Vector Machine Approaches to Operator Functional State Feature Selection and Classification
    Yin Zhong
    Zhang Jianhua
    Xia Jiajun
    [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3662 - 3667
  • [7] Accelerated recursive feature elimination based on support vector machine for key variable identification
    Mao, Y
    Pi, DY
    Liu, YM
    Sun, YX
    [J]. CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2006, 14 (01) : 65 - 72
  • [8] Accelerated Recursive Feature Elimination Based on Support Vector Machine for Key Variable Identification
    毛勇
    皮道映
    刘育明
    孙优贤
    [J]. Chinese Journal of Chemical Engineering, 2006, (01) : 65 - 72
  • [9] Face spoofing detection based on color texture Markov feature and support vector machine recursive feature elimination
    Zhang, Le-Bing
    Peng, Fei
    Qin, Le
    Long, Min
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 56 - 69
  • [10] Fast Gaussian kernel support vector machine recursive feature elimination algorithm
    Li Zhang
    Xiaohan Zheng
    Qingqing Pang
    Weida Zhou
    [J]. Applied Intelligence, 2021, 51 : 9001 - 9014