Improving SVM Classification on Imbalanced Datasets by Introducing a New Bias

被引:28
|
作者
Nunez, Haydemar [1 ]
Gonzalez-Abril, Luis [2 ]
Angulo, Cecilio [3 ]
机构
[1] Univ Cent Venezuela, Fac Ciencias, Escuela Comp, Paseo Ilustres Caracas 1040, Venezuela
[2] Univ Seville, Seville, Spain
[3] Tech Univ Catalonia, Barcelona, Spain
关键词
Support Vector Machine; Post-processing; Bias; Cost-sensitive strategy: SMOTE; SUPPORT VECTOR MACHINES; SMOTE;
D O I
10.1007/s00357-017-9242-x
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Support Vector Machine (SVM) learning from imbalanced datasets, as well as most learning machines, can show poor performance on the minority class because SVMs were designed to induce a model based on the overall error. To improve their performance in these kind of problems, a low-cost post-processing strategy is proposed based on calculating a new bias to adjust the function learned by the SVM. The proposed bias will consider the proportional size between classes in order to improve performance on the minority class. This solution avoids not only introducing and tuning new parameters, but also modifying the standard optimization problem for SVM training. Experimental results on 34 datasets, with different degrees of imbalance, show that the proposed method actually improves the classification on imbalanced datasets, by using standardized error measures based on sensitivity and g-means. Furthermore, its performance is comparable to well-known cost-sensitive and Synthetic Minority Over-sampling Technique (SMOTE) schemes, without adding complexity or computational costs.
引用
收藏
页码:427 / 443
页数:17
相关论文
共 50 条
  • [1] Improving SVM Classification on Imbalanced Datasets by Introducing a New Bias
    Haydemar Núñez
    Luis Gonzalez-Abril
    Cecilio Angulo
    Journal of Classification, 2017, 34 : 427 - 443
  • [2] Improving the Classification Quality of the SVM Classifier for the Imbalanced Datasets on the Base of Ideas the SMOTE Algorithm
    Demidova, Liliya
    Klyueva, Irina
    2017 SEMINAR ON SYSTEMS ANALYSIS, 2017, 10
  • [3] SVM CLASSIFICATION BASED ON THE IMBALANCED DATASETS FOR PROBLEMS OF PSYCHODIAGNOSTICS
    Demidova, Liliya
    Klyueva, Irina
    Pylkin, Alexander
    ICPE 2017: INTERNATIONAL CONFERENCE ON PSYCHOLOGY AND EDUCATION, 2017, 33 : 95 - 103
  • [4] Kernel-Based SMOTE for SVM Classification of Imbalanced Datasets
    Mathew, Josey
    Luo, Ming
    Pang, Chee Khiang
    Chan, Hian Leng
    IECON 2015 - 41ST ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2015, : 1127 - 1132
  • [5] Performance of SVM with Multiple Kernel Learning for Classification Tasks of Imbalanced Datasets
    Saeed, Sana
    Ong, Hong Choon
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2019, 27 (01): : 527 - 545
  • [6] RSMOTE: improving classification performance over imbalanced medical datasets
    Naseriparsa, Mehdi
    Al-Shammari, Ahmed
    Sheng, Ming
    Zhang, Yong
    Zhou, Rui
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [7] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
    Xiao, Z.
    Wang, L.
    Du, J. Y.
    IEEE ACCESS, 2019, 7 : 28281 - 28290
  • [8] RSMOTE: improving classification performance over imbalanced medical datasets
    Mehdi Naseriparsa
    Ahmed Al-Shammari
    Ming Sheng
    Yong Zhang
    Rui Zhou
    Health Information Science and Systems, 8
  • [9] Improving Image Annotation in Imbalanced Classification Problems with Ranking SVM
    Fakeri-Tabrizi, Ali
    Tollari, Sabrina
    Usunier, Nicolas
    Gallinari, Patrick
    MULTILINGUAL INFORMATION ACCESS EVALUATION II: MULTIMEDIA EXPERIMENTS, PT II, 2010, 6242 : 291 - 294
  • [10] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +