A New Optimal Ensemble Algorithm Based on SVDD Sampling for Imbalanced Data Classification

被引:1
|
作者
Pirgazi, Jamshid [1 ]
Pirmohammadi, Abbas [2 ]
Shams, Reza [3 ]
机构
[1] Univ Sci & Technol Mazandaran, Dept Elect & Comp Engn, Behshahr, Iran
[2] Univ Zanjan, Dept Comp Engn, Zanjan, Iran
[3] Shahrood Univ Technol, Fac Informat Technol & Comp Engn, Shahrood, Iran
关键词
Support vector data description; ensemble of classifiers; imbalanced data classification;
D O I
10.1142/S0218001421500208
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, imbalanced data classification is a hot topic in data mining and recently, several valuable researches have been conducted to overcome certain difficulties in the field. Moreover, those approaches, which are based on ensemble classifiers, have achieved reasonable results. Despite the success of these works, there are still many unsolved issues such as disregarding the importance of samples in balancing, determination of proper number of classifiers and optimizing weights of base classifiers in voting stage of ensemble methods. This paper intends to find an admissible solution for these challenges. The solution suggested in this paper applies the support vector data descriptor (SVDD) for sampling both minority and majority classes. After determining the optimal number of base classifiers, the selected samples are utilized to adjust base classifiers. Finally, genetic algorithm optimization is used in order to find the optimum weights of each base classifier in the voting stage. The proposed method is compared with some existing algorithms. The results of experiments confirm its effectiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Two-step ensemble under-sampling algorithm for massive imbalanced data classification
    Bai, Lin
    Ju, Tong
    Wang, Hao
    Lei, Mingzhu
    Pan, Xiaoying
    [J]. INFORMATION SCIENCES, 2024, 665
  • [2] Over-sampling algorithm for imbalanced data classification
    Xu Xiaolong
    Chen Wen
    Sun Yanfei
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2019, 30 (06) : 1182 - 1191
  • [3] Over-sampling algorithm for imbalanced data classification
    XU Xiaolong
    CHEN Wen
    SUN Yanfei
    [J]. Journal of Systems Engineering and Electronics, 2019, 30 (06) : 1182 - 1191
  • [4] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Bo Sun
    Haiyan Chen
    Jiandong Wang
    Hua Xie
    [J]. Frontiers of Computer Science, 2018, 12 : 331 - 350
  • [5] A Novel Selective Ensemble Algorithm for Imbalanced Data Classification Based on Exploratory Undersampling
    Yin, Qing-Yan
    Zhang, Jiang-She
    Zhang, Chun-Xia
    Ji, Nan-Nan
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [6] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Sun, Bo
    Chen, Haiyan
    Wang, Jiandong
    Xie, Hua
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (02) : 331 - 350
  • [7] Imbalanced data classification based on improved EIWAPSO-AdaBoost-C ensemble algorithm
    Li, Xiao
    Li, Kewen
    [J]. APPLIED INTELLIGENCE, 2022, 52 (06) : 6477 - 6502
  • [8] Imbalanced data classification based on improved EIWAPSO-AdaBoost-C ensemble algorithm
    Xiao Li
    Kewen Li
    [J]. Applied Intelligence, 2022, 52 : 6477 - 6502
  • [9] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [10] CLUS: A New Hybrid Sampling Classification for Imbalanced Data
    Prachuabsupakij, Wanthanee
    [J]. PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 281 - 286