Correlated Based SVM-RFE as Feature Selection for Cancer Classification Using Microarray Databases

被引:4
|
作者
Rustam, Z. [1 ]
Maghfirah, N. [1 ]
机构
[1] Univ Indonesia, Fac Math & Nat Sci FMIPA, Dept Math, Depok 16424, Indonesia
关键词
SVM; KFCM; Polynomial Kernel;
D O I
10.1063/1.5064232
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
A lot of research about cancer dataset classification has been done to decrease the number of death caused by cancer. Cancer microarray dataset consists of a large number of features that if we use all of them will spend time, cost, and memory capacities. It necessary to reduce the number of features using feature selection. We need to choose a feature selection method that not only eliminate the irrelevant features, but also consider the existence of correlated genes. If we ignore the correlated genes, it will lead to the disappearance of important information about cancer itself. To prove that feature selection will give higher accuracy, this research will compare the accuracy between classification of datasets without feature selection and with feature selection. This research use CSVM-RFE as feature selection method. To classify, this research use SVM and KFCM with two different kernel types, that is Gaussian RBF Kernel with sigma = 0.05 and Polynomial Kernel with degree = 3. Those methods are applied on three different cancer datasets. As a result, highest accuracy of colon cancer dataset is 98.6 % using SVM based RBF Kernel. Highest accuracy of prostate cancer dataset is 99.2 % using SVM based polynomial kernel, and highest accuracy of lymphoma cancer dataset is 99.1 % using SVM based RBF kernel.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] SVM-RFE based feature selection for tandem mass spectrum quality assessment
    Ding, Jiarui
    Shi, Jinhong
    Wu, Fang-Xiang
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2011, 5 (01) : 73 - 88
  • [22] A Hybrid Feature Selection Approach by Correlation-based Filters and SVM-RFE
    Zhang, Jing
    Hu, Xuegang
    Li, Peipei
    He, Wei
    Zhang, Yuhong
    Li, Huizong
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3684 - 3689
  • [23] Nonlinear feature selection using Gaussian kernel SVM-RFE for fault diagnosis
    Xue, Yangtao
    Zhang, Li
    Wang, Bangjun
    Zhang, Zhao
    Li, Fanzhang
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3306 - 3331
  • [24] An Improved SVM-RFE Based on F-Statistic and mPDC for Gene Selection in Cancer Classification
    Luo, Kangyang
    Wang, Guoqiang
    Li, Qian
    Tao, Jiyuan
    IEEE ACCESS, 2019, 7 : 147617 - 147628
  • [25] Hepatitis Detection using Random Forest based on SVM-RFE (Recursive Feature Elimination) Feature Selection and SMOTE
    Krisnabayu, Rifky Yunus
    Ridok, Achmad
    Budi, Agung Setia
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 151 - 156
  • [26] Classification of lip color based on multiple SVM-RFE
    Wang, Jingjing
    Li, Xiaoqiang
    Fan, Huafu
    Li, Fufeng
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 769 - 772
  • [27] MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
    Zhou, Xin
    Tuck, David P.
    BIOINFORMATICS, 2007, 23 (09) : 1106 - 1114
  • [28] SVM-RFE-ED: A Novel SVM-RFE based on Energy Distance for Gene Selection and Cancer Diagnosis
    Medjahed, Seyyid Ahmed
    Ouali, Mohammed
    COMPUTACION Y SISTEMAS, 2018, 22 (02): : 675 - 683
  • [29] A Novel SVM-RFE for Gene Selection
    Tan, Jun-Yan
    Yang, Zhi-Xia
    Deng, Naiyang
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2009, 11 : 237 - +
  • [30] Multiple SVM-RFE using Boosting for Mammogram Classification
    Yoon, Sejong
    Kim, Saejoon
    INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION, VOL 1, PROCEEDINGS, 2009, : 740 - 742