Solving imbalanced classification problems with support vector machines

被引:0
|
作者
Lessmann, S [1 ]
机构
[1] Univ Hamburg, Inst Business Informat Syst, Hamburg, Germany
关键词
support vector machine; sampling; imbalanced classification; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Support Vector Machine (SVM) is a powerful learning mechanism and promising results have been obtained in the field of medical diagnostics and text-categorization. However, successful applications to business oriented classification problems are still limited. Most real world data sets exhibit vast class imbalances and an accurate identification of the economical relevant minority class is a major challenge within this domain. Based upon an empirical experiment, we evaluate the adequacy of SVMs to identify the respondents of a mailing campaign, massively underrepresented in our data set finding SVM to be capable of handling class imbalances in an internal manner providing robust and competitive results when compared to re-sampling methods which are commonly used to account for class imbalances. Consequently, the overall process of data pre-processing is simplified when applying a SVM classifier leading to less time consuming and more cost-efficient analysis.
引用
收藏
页码:214 / 220
页数:7
相关论文
共 50 条
  • [1] Solving Nonstationary Classification Problems with Coupled Support Vector Machines
    Grinblat, Guillermo L.
    Uzal, Lucas C.
    Alejandro Ceccatto, H.
    Granitto, Pablo M.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (01): : 37 - 51
  • [2] Handling Imbalanced Classification Problems With Support Vector Machines via Evolutionary Bilevel Optimization
    Rosales-Perez, Alejandro
    Garcia, Salvador
    Herrera, Francisco
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4735 - 4747
  • [3] Locally Linear Support Vector Machines for Imbalanced Data Classification
    Krawczyk, Bartosz
    Cano, Alberto
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 616 - 628
  • [4] Krein twin support vector machines for imbalanced data classification
    Jimenez-Castano, C.
    Alvarez-Meza, A.
    Cardenas-Pena, D.
    Orozco-Gutierrez, A.
    Guerrero-Erazo, J.
    [J]. PATTERN RECOGNITION LETTERS, 2024, 182 : 39 - 45
  • [5] Classification of Imbalanced Data by Oversampling in Kernel Space of Support Vector Machines
    Mathew, Josey
    Pang, Chee Khiang
    Luo, Ming
    Leong, Weng Hoe
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4065 - 4076
  • [6] Research on classification technique for imbalanced dataset based on support vector machines
    Yang, Zhiming
    Peng, Yu
    Peng, Xiyuan
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2009, 30 (05): : 1094 - 1099
  • [7] Imbalanced data classification via support vector machines and genetic algorithms
    Cervantes, Jair
    Li, Xiaoou
    Yu, Wen
    [J]. CONNECTION SCIENCE, 2014, 26 (04) : 335 - 348
  • [8] Performance of the Support Vector Machines for Medical Classification Problems
    Cwiklinska-Jurkowska, Malgorzata
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2009, 29 (04) : 63 - 81
  • [9] An Genetic Approach to Support Vector Machines in classification problems
    Padilha, Carlos Alberto de A.
    Lima, Naiyan Hari C.
    Doria Neto, Adriao Duarte
    de Melo, Jorge Dantas
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [10] SUPPORT VECTOR MACHINES APPLIED TO BINARY CLASSIFICATION PROBLEMS
    Hoyo, Alexander
    [J]. CISCI 2007: 6TA CONFERENCIA IBEROAMERICANA EN SISTEMAS, CIBERNETICA E INFORMATICA, MEMORIAS, VOL I, 2007, : 49 - 54