Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification

被引:2
|
作者
Ladayya, Faroh [1 ]
Purnami, Santi Wulan [1 ]
Irhamah [1 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Stat, Kampus ITS Sukolilo, Surabaya 60111, Indonesia
关键词
CANCER; PREDICTION;
D O I
10.1063/1.5012168
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Dense fuzzy support vector machine to binary classification for imbalanced data
    Wang, Qingling
    Zheng, Jian
    Zhang, Wenjing
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9643 - 9653
  • [2] Imbalanced Data Classification using Complementary Fuzzy Support Vector Machine Techniques and SMOTE
    Pruengkarn, Ratchakoon
    Wong, Kok Wai
    Fung, Chun Che
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 978 - 983
  • [3] Fuzzy support vector machine for imbalanced data with borderline noise
    Liu, Jie
    [J]. Fuzzy Sets and Systems, 2021, 413 : 64 - 73
  • [4] Fuzzy support vector machine for imbalanced data with borderline noise
    Liu, Jie
    [J]. FUZZY SETS AND SYSTEMS, 2021, 413 : 64 - 73
  • [5] Combine Sampling Support Vector Machine for Imbalanced Data Classification
    Sain, Hartayuni
    Purnami, Santi Wulan
    [J]. THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 59 - 66
  • [6] Integration of feature vector selection and support vector machine for classification of imbalanced data
    Liu, Jie
    Zio, Enrico
    [J]. APPLIED SOFT COMPUTING, 2019, 75 : 702 - 711
  • [7] Fuzzy Support Vector Machine with Imbalanced regulator and its Application in stroke Classification
    Zhang, Xueying
    Wei, Xin
    Li, Fenglian
    Hu, Fengyun
    Jia, Wenhui
    Wang, Chao
    [J]. 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 290 - 295
  • [8] Deep Learning-Based Imbalanced Classification With Fuzzy Support Vector Machine
    Wang, Ke-Fan
    An, Jing
    Wei, Zhen
    Cui, Can
    Ma, Xiang-Hua
    Ma, Chao
    Bao, Han-Qiu
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 9
  • [9] Imbalanced data classification algorithm with support vector machine kernel extensions
    Wang, Feng
    Liu, Shaojiang
    Ni, Weichuan
    Xu, Zhiming
    Qiu, Zemin
    Wan, Zhiping
    Pan, Zhihong
    [J]. EVOLUTIONARY INTELLIGENCE, 2019, 12 (03) : 341 - 347
  • [10] AN OPTIMIZED SUPPORT VECTOR MACHINE WITH GENETIC ALGORITHM FOR IMBALANCED DATA CLASSIFICATION
    Shamsudin, Haziqah
    Yusof, Umi Kalsom
    Haijie, Yan
    Isa, Iza Sazanita
    [J]. JURNAL TEKNOLOGI-SCIENCES & ENGINEERING, 2023, 85 (04): : 67 - 74