Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification

被引:2
|
作者
Ladayya, Faroh [1 ]
Purnami, Santi Wulan [1 ]
Irhamah [1 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Stat, Kampus ITS Sukolilo, Surabaya 60111, Indonesia
关键词
CANCER; PREDICTION;
D O I
10.1063/1.5012168
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Least squares fuzzy one-class support vector machine for imbalanced data
    Zhang, Jingjing
    Wang, Kuaini
    Zhu, Wenxin
    Zhong, Ping
    [J]. International Journal of Signal Processing, Image Processing and Pattern Recognition, 2015, 8 (08) : 299 - 308
  • [32] Kernel local outlier factor-based fuzzy support vector machine for imbalanced classification
    Wang, Kefan
    An, Jing
    Yu, Zibo
    Yin, Xingshu
    Ma, Chao
    [J]. Concurrency and Computation: Practice and Experience, 2021, 33 (13)
  • [33] Kernel local outlier factor-based fuzzy support vector machine for imbalanced classification
    Wang, Kefan
    An, Jing
    Yu, Zibo
    Yin, Xingshu
    Ma, Chao
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13):
  • [34] Microarray Data Analysis with Support Vector Machine
    Du, Si-Hao
    Jeng, Jin-Tsong
    Su, Shun-Feng
    Chang, Sheng-Chieh
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 143 - 150
  • [35] Affective detection based on an imbalanced fuzzy support vector machine
    Cheng, Jing
    Liu, Guang-Yuan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 18 : 118 - 126
  • [36] Imbalanced data classification based on scaling kernel-based support vector machine
    Yong Zhang
    Panpan Fu
    Wenzhe Liu
    Guolong Chen
    [J]. Neural Computing and Applications, 2014, 25 : 927 - 935
  • [37] Fuzzy support vector machine with graph for classifying imbalanced datasets
    Chen, Baihua
    Fan, Yuling
    Lan, Weiyao
    Liu, Jinghua
    Cao, Chao
    Gao, Yunlong
    [J]. NEUROCOMPUTING, 2022, 514 : 296 - 312
  • [38] Combining Re-sampling with Twin Support Vector Machine for Imbalanced Data Classification
    Cao, Lu
    Shen, Hong
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 325 - 329
  • [39] Inverse free reduced universum twin support vector machine for imbalanced data classification
    Moosaei, Hossein
    Ganaie, M. A.
    Hladik, Milan
    Tanveer, M.
    [J]. NEURAL NETWORKS, 2023, 157 : 125 - 135
  • [40] Between-Class Discriminant Twin Support Vector Machine for Imbalanced Data Classification
    Liu, Lu
    Wang, Lei
    Ji, Hongbing
    Zang, Weihao
    Li, Danping
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7117 - 7122