A New Optimal Ensemble Algorithm Based on SVDD Sampling for Imbalanced Data Classification

被引:1
|
作者
Pirgazi, Jamshid [1 ]
Pirmohammadi, Abbas [2 ]
Shams, Reza [3 ]
机构
[1] Univ Sci & Technol Mazandaran, Dept Elect & Comp Engn, Behshahr, Iran
[2] Univ Zanjan, Dept Comp Engn, Zanjan, Iran
[3] Shahrood Univ Technol, Fac Informat Technol & Comp Engn, Shahrood, Iran
关键词
Support vector data description; ensemble of classifiers; imbalanced data classification;
D O I
10.1142/S0218001421500208
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, imbalanced data classification is a hot topic in data mining and recently, several valuable researches have been conducted to overcome certain difficulties in the field. Moreover, those approaches, which are based on ensemble classifiers, have achieved reasonable results. Despite the success of these works, there are still many unsolved issues such as disregarding the importance of samples in balancing, determination of proper number of classifiers and optimizing weights of base classifiers in voting stage of ensemble methods. This paper intends to find an admissible solution for these challenges. The solution suggested in this paper applies the support vector data descriptor (SVDD) for sampling both minority and majority classes. After determining the optimal number of base classifiers, the selected samples are utilized to adjust base classifiers. Finally, genetic algorithm optimization is used in order to find the optimum weights of each base classifier in the voting stage. The proposed method is compared with some existing algorithms. The results of experiments confirm its effectiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Over-Sampling Algorithm Based on VAE in Imbalanced Classification
    Zhang, Chunkai
    Zhou, Ying
    Chen, Yingyang
    Deng, Yepeng
    Wang, Xuan
    Dong, Lifeng
    Wei, Haoyu
    [J]. CLOUD COMPUTING - CLOUD 2018, 2018, 10967 : 334 - 344
  • [22] A cluster-based SMOTE both-sampling (CSBBoost) ensemble algorithm for classifying imbalanced data
    Salehi, Amir Reza
    Khedmati, Majid
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [23] An Ensemble Classification Model Based on Imbalanced Data for Aviation Safety
    NI Xiaomei
    WANG Huawei
    LV Shaolan
    XIONG Minglan
    [J]. Wuhan University Journal of Natural Sciences, 2021, 26 (05) : 437 - 443
  • [24] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    [J]. International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [25] A cluster-based SMOTE both-sampling (CSBBoost) ensemble algorithm for classifying imbalanced data
    Amir Reza Salehi
    Majid Khedmati
    [J]. Scientific Reports, 14
  • [26] Entropy-based hybrid sampling ensemble learning for imbalanced data
    Dongdong, Li
    Ziqiu, Chi
    Bolu, Wang
    Zhe, Wang
    Hai, Yang
    Wenli, Du
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (07) : 3039 - 3067
  • [27] EVOLUTIONARY-BASED ENSEMBLE UNDER-SAMPLING FOR IMBALANCED DATA
    Zhang, Yongqing
    Lu, Rongzhao
    Huang, Ji
    Gao, Dongrui
    [J]. 2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP), 2019, : 212 - 216
  • [28] Classification of Imbalanced Data Sets by Using the Hybrid Re-sampling Algorithm Based on Isomap
    Gu, Qiong
    Cai, Zhihua
    Zhu, Li
    [J]. ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2009, 5821 : 287 - +
  • [29] Handling imbalanced data with concept drift by applying dynamic sampling and ensemble classification model
    Ancy, S.
    Paulraj, D.
    [J]. COMPUTER COMMUNICATIONS, 2020, 153 : 553 - 560
  • [30] Data mining based fuzzy classification algorithm for imbalanced data
    Xu, Le
    Chow, Mo-Yuen
    Taylor, Leroy S.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 825 - +