A Novel Method for Highly Imbalanced Classification with Weighted Support Vector Machine

被引:2
|
作者
Qi, Biao [1 ,2 ]
Jiang, Jianguo [1 ,2 ]
Shi, Zhixin [1 ,2 ]
Li, Meimei [1 ,2 ]
Fan, Wei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Highly imbalanced classification; Undersampling; GWSVM-RU; Information granules; Weighted SVMs;
D O I
10.1007/978-3-030-29551-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real life, the problem of imbalanced data classification is unavoidable and difficult to solve. Traditional SVMs based classification algorithms usually cannot classify highly imbalanced data accurately, and sampling strategies are widely used to help settle the matter. In this paper, we put forward a novel undersampling method i.e., granular weighted SVMs-repetitive under-sampling (GWSVM-RU) for highly imbalanced classification, which is a weighted SVMs version of the granular SVMs-repetitive undersampling (GSVM-RU) once proposed by Yuchun Tang et al. We complete the undersampling operation by extracting the negative information granules repetitively which are obtained through the naive SVMs algorithm, and then combine the negative and positive granules again to compose the new training data sets. Thus we rebalance the original imbalanced data sets and then build new models by weighted SVMs to predict the testing data set. Besides, we explore four other rebalance heuristic mechanisms including cost-sensitive learning, undersampling, oversampling and GSVM-RU, our approach holds the higher classification performance defined by new evaluation metrics including G-Mean, F-Measure and AUC-ROC. Theories and experiments reveal that our approach outperforms other methods.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [41] A novel robust twin support vector machine for classification
    Cheng, Haoxiang
    Wang, Jian
    Journal of Computational Information Systems, 2015, 11 (12): : 4421 - 4427
  • [42] Weighted support vector machine for classification with uneven training class sizes
    Huang, YM
    Du, SX
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4365 - 4369
  • [43] A fuzzy classification method based on support vector machine
    He, Q
    Wang, XZ
    Xing, HJ
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1237 - 1240
  • [44] Study on Classification Method Based on Support Vector Machine
    Men, Hong
    Gao, Yanchun
    Wu, Yujie
    Li, Xiaoying
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 369 - 373
  • [45] Combine Vector Quantization and Support Vector Machine for imbalanced datasets
    Yu, Ting
    Debenham, John
    Jan, Tony
    Simoff, Simeon
    ARTIFICIAL INTELLIGENCE IN THEORY AND PRACTICE, 2006, 217 : 81 - +
  • [46] Classification of Coal Bursting Liability Based on Support Vector Machine and Imbalanced Sample Set
    Li, Yuefeng
    Wang, Chao
    Liu, Yv
    MINERALS, 2023, 13 (01)
  • [47] Imbalanced Data Classification using Complementary Fuzzy Support Vector Machine Techniques and SMOTE
    Pruengkarn, Ratchakoon
    Wong, Kok Wai
    Fung, Chun Che
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 978 - 983
  • [48] Imbalanced data classification based on scaling kernel-based support vector machine
    Yong Zhang
    Panpan Fu
    Wenzhe Liu
    Guolong Chen
    Neural Computing and Applications, 2014, 25 : 927 - 935
  • [49] Combining Re-sampling with Twin Support Vector Machine for Imbalanced Data Classification
    Cao, Lu
    Shen, Hong
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 325 - 329
  • [50] A Novel Weighted Support Vector Machine Based on Particle Swarm Optimization for Gene Selection and Tumor Classification
    Abdi, Mohammad Javad
    Hosseini, Seyed Mohammad
    Rezghi, Mansoor
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012