Oversampling technique based on fuzzy representativeness difference for classifying imbalanced data

被引:0
|
作者
Ruonan Ren
Youlong Yang
Liqin Sun
机构
[1] Xidian University,School of Mathematics and Statistics
来源
Applied Intelligence | 2020年 / 50卷
关键词
Class imbalanced problem; Oversampling technique; Affinity propagation; Chromosome theory of inheritance; Fuzzy representativeness difference;
D O I
暂无
中图分类号
学科分类号
摘要
Class imbalance problem poses a difficulty to learning algorithms in pattern classification. Oversampling techniques is one of the most widely used techniques to solve these problems, but the majority of them use the sample size ratio as an imbalanced standard. This paper proposes a fuzzy representativeness difference-based oversampling technique, using affinity propagation and the chromosome theory of inheritance (FRDOAC). The fuzzy representativeness difference (FRD) is adopted as a new imbalance metric, which focuses on the importance of samples rather than the number. FRDOAC firstly finds the representative samples of each class according to affinity propagation. Secondly, fuzzy representativeness of every sample is calculated by the Mahalanobis distance. Finally, synthetic positive samples are generated by the chromosome theory of inheritance until the fuzzy representativeness difference of two classes is small. A thorough experimental study on 16 benchmark datasets was performed and the results show that our method is better than other advanced imbalanced classification algorithms in terms of various evaluation metrics.
引用
收藏
页码:2465 / 2487
页数:22
相关论文
共 50 条
  • [1] Oversampling technique based on fuzzy representativeness difference for classifying imbalanced data
    Ren, Ruonan
    Yang, Youlong
    Sun, Liqin
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2465 - 2487
  • [2] Hybrid Oversampling Technique Based on Star Topology and Rejection Methodology for Classifying Imbalanced Data
    Lee, Chaekyu
    Kim, Jaekwang
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 1217 - 1226
  • [3] Entropy difference and kernel-based oversampling technique for imbalanced data learning
    Wu, Xu
    Yang, Youlong
    Ren, Lingyu
    INTELLIGENT DATA ANALYSIS, 2020, 24 (06) : 1239 - 1255
  • [4] Fuzzy rule-based oversampling technique for imbalanced and incomplete data learning
    Liu, Gencheng
    Yang, Youlong
    Li, Benchong
    KNOWLEDGE-BASED SYSTEMS, 2018, 158 : 154 - 174
  • [5] An Improved Oversampling Algorithm Based on the Samples' Selection Strategy for Classifying Imbalanced Data
    Xie, Wenhao
    Liang, Gongqian
    Dong, Zhonghui
    Tan, Baoyu
    Zhang, Baosheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [6] Anomaly detection and oversampling approach for classifying imbalanced data using CLUBS technique in IoT healthcare data
    Subha, S.
    Sathiaseelan, J. G. R.
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2023, 11 (03) : 255 - 271
  • [7] Distance-based arranging oversampling technique for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Zhao, Jia-Liang
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1323 - 1342
  • [8] Distance-based arranging oversampling technique for imbalanced data
    Qi Dai
    Jian-wei Liu
    Jia-Liang Zhao
    Neural Computing and Applications, 2023, 35 : 1323 - 1342
  • [9] Multiple Kernel Learning With Minority Oversampling for Classifying Imbalanced Data
    Wang, Ling
    Wang, Hongqiao
    Fu, Guangyuan
    IEEE ACCESS, 2021, 9 : 565 - 580
  • [10] A No Parameter Synthetic Minority Oversampling Technique Based on Finch for Imbalanced Data
    Xu, Shoukun
    Li, Zhibang
    Yuan, Baohua
    Yang, Gaochao
    Wang, Xueyuan
    Li, Ning
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 367 - 378