Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification

被引:2
|
作者
Ladayya, Faroh [1 ]
Purnami, Santi Wulan [1 ]
Irhamah [1 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Stat, Kampus ITS Sukolilo, Surabaya 60111, Indonesia
关键词
CANCER; PREDICTION;
D O I
10.1063/1.5012168
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [21] Hierarchically penalized support vector machine for the classification of imbalanced data with grouped variables
    Kim, Eunkyung
    Jhun, Myoungshic
    Bang, Sungwan
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (05) : 961 - 975
  • [22] Maximum Margin of Twin Spheres Support Vector Machine for Imbalanced Data Classification
    Xu, Yitian
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1540 - 1550
  • [23] Anomalous Propagation Echo Classification of Imbalanced Radar Data with Support Vector Machine
    Lee, Hansoo
    Kim, Eun Kyeong
    Kim, Sungshin
    ADVANCES IN METEOROLOGY, 2016, 2016
  • [24] A novel twin-support vector machine for binary classification to imbalanced data
    Li, Jingyi
    Chao, Shiwei
    DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (03) : 385 - 396
  • [25] Support vector machine for classification based on fuzzy training data
    Ji, Ai-bing
    Pang, Jia-hong
    Qiu, Hong-jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) : 3495 - 3498
  • [26] Support vector machine for classification based on fuzzy training data
    Ji, Ai-Bing
    Pang, Jia-Hong
    Li, Shu-Huan
    Sun, Jian-Ping
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1609 - +
  • [27] Imbalanced classification using support vector machine ensemble
    Jiang Tian
    Hong Gu
    Wenqi Liu
    Neural Computing and Applications, 2011, 20 : 203 - 209
  • [28] Imbalanced classification using support vector machine ensemble
    Tian, Jiang
    Gu, Hong
    Liu, Wenqi
    NEURAL COMPUTING & APPLICATIONS, 2011, 20 (02): : 203 - 209
  • [29] Multiclass Imbalanced Classification Using Fuzzy C-Mean and SMOTE with Fuzzy Support Vector Machine
    Pruengkarn, Ratchakoon
    Wong, Kok Wai
    Fung, Chun Che
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 67 - 75
  • [30] Support Vector Machine for Imbalanced Microarray Dataset Classification Using Ant Colony Optimization and Genetic Algorithm
    Nurlaily, Diana
    Irhamah
    Purnami, Santi Wulan
    Kuswanto, Heri
    2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION, 2019, 2019, 2194