A Novel Method for Highly Imbalanced Classification with Weighted Support Vector Machine

被引:2
|
作者
Qi, Biao [1 ,2 ]
Jiang, Jianguo [1 ,2 ]
Shi, Zhixin [1 ,2 ]
Li, Meimei [1 ,2 ]
Fan, Wei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Highly imbalanced classification; Undersampling; GWSVM-RU; Information granules; Weighted SVMs;
D O I
10.1007/978-3-030-29551-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real life, the problem of imbalanced data classification is unavoidable and difficult to solve. Traditional SVMs based classification algorithms usually cannot classify highly imbalanced data accurately, and sampling strategies are widely used to help settle the matter. In this paper, we put forward a novel undersampling method i.e., granular weighted SVMs-repetitive under-sampling (GWSVM-RU) for highly imbalanced classification, which is a weighted SVMs version of the granular SVMs-repetitive undersampling (GSVM-RU) once proposed by Yuchun Tang et al. We complete the undersampling operation by extracting the negative information granules repetitively which are obtained through the naive SVMs algorithm, and then combine the negative and positive granules again to compose the new training data sets. Thus we rebalance the original imbalanced data sets and then build new models by weighted SVMs to predict the testing data set. Besides, we explore four other rebalance heuristic mechanisms including cost-sensitive learning, undersampling, oversampling and GSVM-RU, our approach holds the higher classification performance defined by new evaluation metrics including G-Mean, F-Measure and AUC-ROC. Theories and experiments reveal that our approach outperforms other methods.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [31] Increasing Minority Recall Support Vector Machine Model for Imbalanced Data Classification
    Wu, Chunye
    Wang, Nan
    Wang, Yu
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021
  • [32] Deep Learning-Based Imbalanced Classification With Fuzzy Support Vector Machine
    Wang, Ke-Fan
    An, Jing
    Wei, Zhen
    Cui, Can
    Ma, Xiang-Hua
    Ma, Chao
    Bao, Han-Qiu
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 9
  • [33] An Adaptive Pre-clustering Support Vector Machine for Binary Imbalanced Classification
    Di, Zonglin
    Yao, Siya
    Kang, Qi
    Zhou, Mengchu
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 681 - 686
  • [34] Fuzzy Support Vector Machine with Imbalanced regulator and its Application in stroke Classification
    Zhang, Xueying
    Wei, Xin
    Li, Fenglian
    Hu, Fengyun
    Jia, Wenhui
    Wang, Chao
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 290 - 295
  • [35] Hierarchically penalized support vector machine for the classification of imbalanced data with grouped variables
    Kim, Eunkyung
    Jhun, Myoungshic
    Bang, Sungwan
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (05) : 961 - 975
  • [36] Maximum Margin of Twin Spheres Support Vector Machine for Imbalanced Data Classification
    Xu, Yitian
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1540 - 1550
  • [37] Anomalous Propagation Echo Classification of Imbalanced Radar Data with Support Vector Machine
    Lee, Hansoo
    Kim, Eun Kyeong
    Kim, Sungshin
    ADVANCES IN METEOROLOGY, 2016, 2016
  • [38] An Effective and Novel Weighted Support Vector Machine Method for Control Chart Pattern Recognition
    Chen, Jianping
    Xia, Beixin
    Chen, Xin
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND ENGINEERING APPLICATIONS, 2016, 63 : 140 - 142
  • [39] A Segmented Local Offset Method for Imbalanced Data Classification Using Quasi-Linear Support Vector Machine
    Liang, Peifeng
    Yuan, Xin
    Li, Weite
    Hu, Jinglu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 746 - 751
  • [40] Support Vector Machine Failure in Imbalanced Datasets
    Illan, I. A.
    Gorriz, J. M.
    Ramirez, J.
    Martinez-Murcia, F. J.
    Castillo-Barnes, D.
    Segovia, F.
    Salas-Gonzalez, D.
    UNDERSTANDING THE BRAIN FUNCTION AND EMOTIONS, PT I, 2019, 11486 : 412 - 419