Weighted L-1-Norm Support Vector Machine for the Classification of Highly Imbalanced Data

被引:1
|
作者
Kim, Eunkyung [1 ]
Jhun, Myoungshic [1 ]
Bang, Sungwan [2 ]
机构
[1] Korea Univ, Dept Stat, Seoul, South Korea
[2] Korea Mil Acad, Dept Math, 574 Hwarang Rd, Seoul 139799, South Korea
基金
新加坡国家研究基金会;
关键词
Imbalanced data; lasso; linear programming; ridge; support vector machine;
D O I
10.5351/KJAS.2015.28.1.009
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The support vector machine has been successfully applied to various classification areas due to its flexibility and a high level of classification accuracy. However, when analyzing imbalanced data with uneven class sizes, the classification accuracy of SVM may drop significantly in predicting minority class because the SVM classifiers are undesirably biased toward the majority class. The weighted L-2-norm SVM was developed for the analysis of imbalanced data; however, it cannot identify irrelevant input variables due to the characteristics of the ridge penalty. Therefore, we propose the weighted L-1-norm SVM, which uses lasso penalty to select important input variables and weights to differentiate the misclassification of data points between classes. We demonstrate the satisfactory performance of the proposed method through simulation studies and a real data analysis.
引用
收藏
页码:9 / 21
页数:13
相关论文
共 50 条
  • [1] A Novel Method for Highly Imbalanced Classification with Weighted Support Vector Machine
    Qi, Biao
    Jiang, Jianguo
    Shi, Zhixin
    Li, Meimei
    Fan, Wei
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 275 - 286
  • [2] Clustering and Weighted Scoring in Geometric Space Support Vector Machine Ensemble for Highly Imbalanced Data Classification
    Ksieniewicz, Pawel
    Burduk, Robert
    [J]. COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 128 - 140
  • [3] An efficient weighted Lagrangian twin support vector machine for imbalanced data classification
    Shao, Yuan-Hai
    Chen, Wei-Jie
    Zhang, Jing-Jing
    Wang, Zhen
    Deng, Nai-Yang
    [J]. PATTERN RECOGNITION, 2014, 47 (09) : 3158 - 3167
  • [4] Weighted support vector machine for extremely imbalanced data
    Mun, Jongmin
    Bang, Sungwan
    Kim, Jaeoh
    [J]. Computational Statistics and Data Analysis, 2025, 203
  • [5] Weighted support vector machine for data classification
    Yang, XL
    Song, Q
    Cao, AZ
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 859 - 864
  • [6] A weighted support vector machine for data classification
    Yang, Xulei
    Song, Qing
    Wang, Yue
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 961 - 976
  • [7] Combine Sampling Support Vector Machine for Imbalanced Data Classification
    Sain, Hartayuni
    Purnami, Santi Wulan
    [J]. THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 59 - 66
  • [8] Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification
    Ladayya, Faroh
    Purnami, Santi Wulan
    Irhamah
    [J]. 13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905
  • [9] Integration of feature vector selection and support vector machine for classification of imbalanced data
    Liu, Jie
    Zio, Enrico
    [J]. APPLIED SOFT COMPUTING, 2019, 75 : 702 - 711
  • [10] Imbalanced data classification algorithm with support vector machine kernel extensions
    Wang, Feng
    Liu, Shaojiang
    Ni, Weichuan
    Xu, Zhiming
    Qiu, Zemin
    Wan, Zhiping
    Pan, Zhihong
    [J]. EVOLUTIONARY INTELLIGENCE, 2019, 12 (03) : 341 - 347