Feature clustering based support vector machine recursive feature elimination for gene selection

被引:94
|
作者
Huang, Xiaojuan [1 ,2 ]
Zhang, Li [1 ,2 ]
Wang, Bangjun [1 ,2 ]
Li, Fanzhang [1 ,2 ]
Zhang, Zhao [1 ,2 ]
机构
[1] Soochow Univ Suzhou, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Soochow Univ Suzhou, Joint Int Res Lab Machine Learning & Neuromorph C, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Support vector machine; Feature selection; Gene clustering; Recursive feature elimination; Gene relevancy; Gene redundancy; SVM-RFE; CANCER CLASSIFICATION; EXPRESSION DATA; DISCOVERY; RELEVANCE; FILTER;
D O I
10.1007/s10489-017-0992-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a DNA microarray dataset, gene expression data often has a huge number of features(which are referred to as genes) versus a small size of samples. With the development of DNA microarray technology, the number of dimensions increases even faster than before, which could lead to the problem of the curse of dimensionality. To get good classification performance, it is necessary to preprocess the gene expression data. Support vector machine recursive feature elimination (SVM-RFE) is a classical method for gene selection. However, SVM-RFE suffers from high computational complexity. To remedy it, this paper enhances SVM-RFE for gene selection by incorporating feature clustering, called feature clustering SVM-RFE (FCSVM-RFE). The proposed method first performs gene selection roughly and then ranks the selected genes. First, a clustering algorithm is used to cluster genes into gene groups, in each which genes have similar expression profile. Then, a representative gene is found to represent a gene group. By doing so, we can obtain a representative gene set. Then, SVM-RFE is applied to rank these representative genes. FCSVM-RFE can reduce the computational complexity and the redundancy among genes. Experiments on seven public gene expression datasets show that FCSVM-RFE can achieve a better classification performance and lower computational complexity when compared with the state-the-art-of methods, such as SVM-RFE.
引用
收藏
页码:594 / 607
页数:14
相关论文
共 50 条
  • [31] Hybrid-Recursive Feature Elimination for Efficient Feature Selection
    Jeon, Hyelynn
    Oh, Sejong
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [32] Stochastic Feature Selection in Support Vector Machine Based Instrument Recognition
    Kramer, Oliver
    Hein, Tobias
    [J]. KI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5803 : 727 - 734
  • [33] Online chatter detection of the end milling based on wavelet packet transform and support vector machine recursive feature elimination
    Chen, G. S.
    Zheng, Q. Z.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 95 (1-4): : 775 - 784
  • [34] Online chatter detection of the end milling based on wavelet packet transform and support vector machine recursive feature elimination
    G. S. Chen
    Q. Z. Zheng
    [J]. The International Journal of Advanced Manufacturing Technology, 2018, 95 : 775 - 784
  • [35] Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
    Yang, Xiaoqiang
    [J]. MOLECULAR MEDICINE REPORTS, 2018, 17 (01) : 1555 - 1560
  • [36] The research on the method of feature selection in support vector Machine based Entropy
    Zhu, Xiaoyan
    Tian, Xi
    Zhu, Xiaoxun
    [J]. PROGRESS IN POWER AND ELECTRICAL ENGINEERING, PTS 1 AND 2, 2012, 354-355 : 1192 - +
  • [37] A novel feature selection method based on quantum support vector machine
    Wang, Haiyan
    [J]. PHYSICA SCRIPTA, 2024, 99 (05)
  • [38] Feature Selection Method Based on Mutual Information and Support Vector Machine
    Liu, Gang
    Yang, Chunlei
    Liu, Sen
    Xiao, Chunbao
    Song, Bin
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (06)
  • [39] INTRUSION DETECTION SYSTEM BASED ON FEATURE SELECTION AND SUPPORT VECTOR MACHINE
    Zhang Xue-qin
    Gu Chun-hua
    Lin Jia-jun
    [J]. 2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
  • [40] Support vector machine for intrusion detection based on LSI feature selection
    Yang, Qing
    Li, Fangmin
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4113 - +