Evolutionary Mahalanobis Distance-Based Oversampling for Multi-Class Imbalanced Data Classification

被引:9
|
作者
Yao, Leehter [1 ]
Lin, Tung-Bin [1 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10618, Taiwan
关键词
oversampling; mahalanobis distance; MOPSO; classification; minority class; ellipsoid; FAULT-DIAGNOSIS; DATA-SETS; REGRESSION; PREDICTION; SMOTEBOOST; SVMS; TOOL;
D O I
10.3390/s21196616
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The number of sensing data are often imbalanced across data classes, for which oversampling on the minority class is an effective remedy. In this paper, an effective oversampling method called evolutionary Mahalanobis distance oversampling (EMDO) is proposed for multi-class imbalanced data classification. EMDO utilizes a set of ellipsoids to approximate the decision regions of the minority class. Furthermore, multi-objective particle swarm optimization (MOPSO) is integrated with the Gustafson-Kessel algorithm in EMDO to learn the size, center, and orientation of every ellipsoid. Synthetic minority samples are generated based on Mahalanobis distance within every ellipsoid. The number of synthetic minority samples generated by EMDO in every ellipsoid is determined based on the density of minority samples in every ellipsoid. The results of computer simulations conducted herein indicate that EMDO outperforms most of the widely used oversampling schemes.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multi-class Imbalanced Data Oversampling for Vertebral Column Pathologies Classification
    Saez, Jose A.
    Quintian, Hector
    Krawczyk, Bartosz
    Wozniak, Michal
    Corchado, Emilio
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 131 - 142
  • [2] Adversarial oversampling for multi-class imbalanced data classification with convolutional neural networks
    Wojciechowski, Adam
    Lango, Mateusz
    [J]. FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 98 - 111
  • [3] Distance-based arranging oversampling technique for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Zhao, Jia-Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1323 - 1342
  • [4] Distance-based arranging oversampling technique for imbalanced data
    Qi Dai
    Jian-wei Liu
    Jia-Liang Zhao
    [J]. Neural Computing and Applications, 2023, 35 : 1323 - 1342
  • [5] An oversampling method for multi-class imbalanced data based on composite weights
    Deng, Mingyang
    Guo, Yingshi
    Wang, Chang
    Wu, Fuwei
    [J]. PLOS ONE, 2021, 16 (11):
  • [6] Global-local information based oversampling for multi-class imbalanced data
    Han, Mingming
    Guo, Husheng
    Li, Jinyan
    Wang, Wenjian
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 2071 - 2086
  • [7] Global-local information based oversampling for multi-class imbalanced data
    Mingming Han
    Husheng Guo
    Jinyan Li
    Wenjian Wang
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 2071 - 2086
  • [8] Support Vector Machines: A Distance-Based Approach to Multi-Class Classification
    Aoudi, Wissam
    Barbar, Aziz M.
    [J]. 2016 IEEE INTERNATIONAL MULTIDISCIPLINARY CONFERENCE ON ENGINEERING TECHNOLOGY (IMCET), 2016, : 75 - 80
  • [9] A New SVM Decision Tree Multi-class Classification Algorithm Based on Mahalanobis Distance
    Diao Zhihua
    Wu Yuanyuan
    [J]. 2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 3124 - 3127
  • [10] A survey of multi-class imbalanced data classification methods
    Han, Meng
    Li, Ang
    Gao, Zhihui
    Mu, Dongliang
    Liu, Shujuan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2471 - 2501