A New Optimal Ensemble Algorithm Based on SVDD Sampling for Imbalanced Data Classification

被引:1
|
作者
Pirgazi, Jamshid [1 ]
Pirmohammadi, Abbas [2 ]
Shams, Reza [3 ]
机构
[1] Univ Sci & Technol Mazandaran, Dept Elect & Comp Engn, Behshahr, Iran
[2] Univ Zanjan, Dept Comp Engn, Zanjan, Iran
[3] Shahrood Univ Technol, Fac Informat Technol & Comp Engn, Shahrood, Iran
关键词
Support vector data description; ensemble of classifiers; imbalanced data classification;
D O I
10.1142/S0218001421500208
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, imbalanced data classification is a hot topic in data mining and recently, several valuable researches have been conducted to overcome certain difficulties in the field. Moreover, those approaches, which are based on ensemble classifiers, have achieved reasonable results. Despite the success of these works, there are still many unsolved issues such as disregarding the importance of samples in balancing, determination of proper number of classifiers and optimizing weights of base classifiers in voting stage of ensemble methods. This paper intends to find an admissible solution for these challenges. The solution suggested in this paper applies the support vector data descriptor (SVDD) for sampling both minority and majority classes. After determining the optimal number of base classifiers, the selected samples are utilized to adjust base classifiers. Finally, genetic algorithm optimization is used in order to find the optimum weights of each base classifier in the voting stage. The proposed method is compared with some existing algorithms. The results of experiments confirm its effectiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] An online ensemble classification algorithm for multi-class imbalanced data stream
    Han, Meng
    Li, Chunpeng
    Meng, Fanxing
    He, Feifei
    Zhang, Ruihua
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (11) : 6845 - 6880
  • [32] Imbalanced Data Classification Algorithm Based on Clustering and SVM
    Huang, Bo
    Zhu, Yimin
    Wang, Zhongzhen
    Fang, Zhijun
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (02)
  • [33] A GEV-Based Classification Algorithm for Imbalanced Data
    Fu J.
    Liu G.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (11): : 2361 - 2371
  • [34] Leveraging ensemble pruning for imbalanced data classification
    Krawczyk, Bartosz
    Wozniak, Michal
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 439 - 444
  • [35] An Improved FCM Algorithm Based on the SVDD for Unsupervised Hyperspectral Data Classification
    Niazmardi, Saeid
    Homayouni, Saeid
    Safari, Abdolreza
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2013, 6 (02) : 831 - 839
  • [36] An Improved Ensemble Learning for Imbalanced Data Classification
    Yuan, Zhengwu
    Zhao, Pu
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 408 - 411
  • [37] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Chen, Zhi
    Lin, Tao
    Xia, Xin
    Xu, Hongyan
    Ding, Sha
    [J]. APPLIED INTELLIGENCE, 2018, 48 (08) : 2441 - 2457
  • [38] A synthetic neighborhood generation based ensemble learning for the imbalanced data classification
    Zhi Chen
    Tao Lin
    Xin Xia
    Hongyan Xu
    Sha Ding
    [J]. Applied Intelligence, 2018, 48 : 2441 - 2457
  • [39] An Effective Sampling Strategy for Ensemble Learning with Imbalanced Data
    Zhang, Chen
    Zhang, Xiaolong
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 377 - 388
  • [40] A Genetic-Based Ensemble Learning Applied to Imbalanced Data Classification
    Klikowski, Jakub
    Ksieniewicz, Pawel
    Wozniak, Michal
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 340 - 352