Boosting support vector machines for imbalanced data sets

被引:0
|
作者
Wang, Benjamin X. [1 ]
Japkowicz, Nathalie [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed vector spaces or lack of information. Common approaches for dealing with the class imbalance problem involve modifying the data distribution or modifying the classifier. In this work, we choose to use a combination of both approaches. We use support vector machines with soft margins as the base classifier to solve the skewed vector spaces problem. Then we use a boosting algorithm to get an ensemble classifier that has lower error than a single classifier. We found that this ensemble of SVMs makes an impressive improvement in prediction performance, not only for the majority class, but also for the minority class.
引用
收藏
页码:38 / 47
页数:10
相关论文
共 50 条
  • [11] Robust twin bounded support vector machines for outliers and imbalanced data
    Parashjyoti Borah
    Deepak Gupta
    Applied Intelligence, 2021, 51 : 5314 - 5343
  • [12] Imbalanced data classification via support vector machines and genetic algorithms
    Cervantes, Jair
    Li, Xiaoou
    Yu, Wen
    CONNECTION SCIENCE, 2014, 26 (04) : 335 - 348
  • [13] Robust twin bounded support vector machines for outliers and imbalanced data
    Borah, Parashjyoti
    Gupta, Deepak
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5314 - 5343
  • [14] Boosting Support Vector Machines Successfully
    Ting, Kai Ming
    Zhu, Lian
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 509 - 518
  • [15] Balance method for imbalanced support vector machines
    Department of Applied Mathematics, Xidian University, Xi'an 710071, China
    不详
    不详
    Moshi Shibie yu Rengong Zhineng, 2008, 2 (136-141):
  • [16] Applying support vector machines to imbalanced datasets
    Akbani, R
    Kwek, S
    Japkowicz, N
    MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 : 39 - 50
  • [17] Training Support Vector Machines on Large Sets of Image Data
    Kukenys, Ignas
    McCane, Brendan
    Neumegen, Tim
    COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 331 - 340
  • [18] BALANCED VS IMBALANCED TRAINING DATA: CLASSIFYING RAPIDEYE DATA WITH SUPPORT VECTOR MACHINES
    Ustuner, M.
    Sanli, F. B.
    Abdikan, S.
    XXIII ISPRS CONGRESS, COMMISSION VII, 2016, 41 (B7): : 379 - 384
  • [19] Neural decoding on imbalanced calcium imaging data with a network of support vector machines
    Lee, Kyunghun
    Wu, Xiaomin
    Lee, Yaesop
    Lin, Da-Ting
    Bhattacharyya, Shuvra S.
    Chen, Rong
    ADVANCED ROBOTICS, 2021, 35 (07) : 459 - 470
  • [20] Active Learning Support Vector Machines to Classify Imbalanced Reservoir Simulation Data
    Yu, Tina
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,