Feature clustering based support vector machine recursive feature elimination for gene selection

被引:94
|
作者
Huang, Xiaojuan [1 ,2 ]
Zhang, Li [1 ,2 ]
Wang, Bangjun [1 ,2 ]
Li, Fanzhang [1 ,2 ]
Zhang, Zhao [1 ,2 ]
机构
[1] Soochow Univ Suzhou, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Soochow Univ Suzhou, Joint Int Res Lab Machine Learning & Neuromorph C, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Support vector machine; Feature selection; Gene clustering; Recursive feature elimination; Gene relevancy; Gene redundancy; SVM-RFE; CANCER CLASSIFICATION; EXPRESSION DATA; DISCOVERY; RELEVANCE; FILTER;
D O I
10.1007/s10489-017-0992-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a DNA microarray dataset, gene expression data often has a huge number of features(which are referred to as genes) versus a small size of samples. With the development of DNA microarray technology, the number of dimensions increases even faster than before, which could lead to the problem of the curse of dimensionality. To get good classification performance, it is necessary to preprocess the gene expression data. Support vector machine recursive feature elimination (SVM-RFE) is a classical method for gene selection. However, SVM-RFE suffers from high computational complexity. To remedy it, this paper enhances SVM-RFE for gene selection by incorporating feature clustering, called feature clustering SVM-RFE (FCSVM-RFE). The proposed method first performs gene selection roughly and then ranks the selected genes. First, a clustering algorithm is used to cluster genes into gene groups, in each which genes have similar expression profile. Then, a representative gene is found to represent a gene group. By doing so, we can obtain a representative gene set. Then, SVM-RFE is applied to rank these representative genes. FCSVM-RFE can reduce the computational complexity and the redundancy among genes. Experiments on seven public gene expression datasets show that FCSVM-RFE can achieve a better classification performance and lower computational complexity when compared with the state-the-art-of methods, such as SVM-RFE.
引用
收藏
页码:594 / 607
页数:14
相关论文
共 50 条
  • [41] Reseach on Feature Selection Algorithm Based on the margin of Support Vector Machine
    Hu, Linfang
    Qiao, Lei
    Huang, Minde
    [J]. MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1430 - 1434
  • [42] Discriminative analysis of schizophrenia using support vector machine and recursive feature elimination on structural MRI images
    Lu, Xiaobing
    Yang, Yongzhe
    Wu, Fengchun
    Gao, Minjian
    Xu, Yong
    Zhang, Yue
    Yao, Yongcheng
    Du, Xin
    Li, Chengwei
    Wu, Lei
    Zhong, Xiaomei
    Zhou, Yanling
    Fan, Ni
    Zheng, Yingjun
    Xiong, Dongsheng
    Peng, Hongjun
    Escudero, Javier
    Huang, Biao
    Li, Xiaobo
    Ning, Yuping
    Wu, Kai
    [J]. MEDICINE, 2016, 95 (30)
  • [43] Group feature selection with multiclass support vector machine
    Tang, Fengzhen
    Adam, Lukas
    Si, Bailu
    [J]. NEUROCOMPUTING, 2018, 317 : 42 - 49
  • [44] Large Margin Feature Selection for Support Vector Machine
    Pan, Wei
    Ma, Peijun
    Su, Xiaohong
    [J]. MECHANICAL ENGINEERING, MATERIALS SCIENCE AND CIVIL ENGINEERING, 2013, 274 : 161 - 164
  • [45] Optimal Feature Selection for Support Vector Machine Classifiers
    Strub, O.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 304 - 308
  • [46] Support Vector Machine with feature selection: A multiobjective approach
    Alcaraz, Javier
    Labbe, Martine
    Landete, Mercedes
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [47] Research and Experiment of Radar Signal Support Vector Clustering Sorting Based on Feature Extraction and Feature Selection
    Wang, Shiqiang
    Gao, Caiyun
    Zhang, Qin
    Dakulagi, Veerendra
    Zeng, Huiyong
    Zheng, Guimei
    Bai, Juan
    Song, Yuwei
    Cai, Jiliang
    Zong, Binfeng
    [J]. IEEE ACCESS, 2020, 8 : 93322 - 93334
  • [48] An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification
    Nafis, Nur Syafiqah Mohd
    Awang, Suryanti
    [J]. IEEE ACCESS, 2021, 9 : 52177 - 52192
  • [49] Customer Churn Prediction Based on Feature Clustering and Nonparallel Support Vector Machine
    Zhao, Xi
    Shi, Yong
    Lee, Jongwon
    Kim, Heung Kee
    Lee, Heeseok
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2014, 13 (05) : 1013 - 1027
  • [50] Combining recursive feature elimination and support vector machines for stock price forecasting
    Li, Menggang
    Qiu, Yi
    Zhang, Zuoquan
    [J]. Journal of Computational Information Systems, 2013, 9 (16): : 6519 - 6526