Novel fuzzy clustering-based undersampling framework for class imbalance problem

被引:2
|
作者
Pratap, Vibha [1 ,2 ]
Singh, Amit Prakash [1 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, USICT, New Delhi, India
[2] Indira Gandhi Delhi Tech Univ Women, Delhi, India
关键词
Class imbalance; Ensemble method; Fuzzy C-mean; Machine learning; Oversampling; Under-sampling; CLASSIFICATION; PREDICTION; SMOTE;
D O I
10.1007/s13198-023-01897-1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The class imbalance problem occurs in various real-world datasets. Although it is considered that samples of the classes of a dataset are evenly distributed, in many cases, datasets are highly imbalanced. Classification of such datasets is challenging in machine learning. Researchers have developed many approaches to solve the class imbalance problem, such as resampling and ensemble methods. In resampling methods, minority class samples are increased (oversampling), or majority class samples are reduced (under-sampling). In contrast, the ensemble methods classify various subsets of data where classification results are combined to provide the final result. The authors have introduced a new fuzzy C-mean clustering-based under-sampling method in the present study. We performed experiments using newly proposed method over 30 small-scale imbalanced datasets. The results obtained revealed that the proposed method improves the classification performance. The average sensitivity improved by 1% and the average balance accuracy improved by 3% as compared to k-means undersampling method. The results of this study would be useful in classification of imbalanced datasets of various domains.
引用
收藏
页码:967 / 976
页数:10
相关论文
共 50 条
  • [1] Novel fuzzy clustering-based undersampling framework for class imbalance problem
    Vibha Pratap
    Amit Prakash Singh
    International Journal of System Assurance Engineering and Management, 2023, 14 : 967 - 976
  • [2] Fast-CBUS: A fast clustering-based undersampling method for addressing the class imbalance problem
    Ofek, Nir
    Rokach, Lior
    Stern, Roni
    Shabtai, Asaf
    NEUROCOMPUTING, 2017, 243 : 88 - 102
  • [3] Addressing Class Imbalance in Customer Response Modeling Using Random and Clustering-Based Undersampling and SVM
    Kascelan, Ljiljana
    Vukovic, Suncica
    IPSI BGD TRANSACTIONS ON INTERNET RESEARCH, 2024, 20 (02): : 1 - 13
  • [4] Clustering-based undersampling in class-imbalanced data
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Hu, Ya-Han
    Jhang, Jing-Shang
    INFORMATION SCIENCES, 2017, 409 : 17 - 26
  • [5] A Novel Clustering-Based Three Level Under-Sampling Algorithm For Class Imbalance Problem
    Pratap, Vibha
    Singh, Amit Prakash
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 27 (04): : 2319 - 2329
  • [6] A Kernel Clustering-Based Possibilistic Fuzzy Extreme Learning Machine for Class Imbalance Learning
    Shi-Xiong Xia
    Fan-Rong Meng
    Bing Liu
    Yong Zhou
    Cognitive Computation, 2015, 7 : 74 - 85
  • [7] A Kernel Clustering-Based Possibilistic Fuzzy Extreme Learning Machine for Class Imbalance Learning
    Xia, Shi-Xiong
    Meng, Fan-Rong
    Liu, Bing
    Zhou, Yong
    COGNITIVE COMPUTATION, 2015, 7 (01) : 74 - 85
  • [8] A Novel Adaptive Undersampling Framework for Class-Imbalance Fault Detection
    Qian, Min
    Li, Yan-Fu
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (03) : 1003 - 1017
  • [9] A Review on Solution to Class Imbalance Problem: Undersampling Approaches
    Devi, Debashree
    Biswas, Saroj K.
    Purkayastha, Biswajit
    2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 626 - 631
  • [10] Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning
    Zhao, Haixia
    Wu, Jian
    JOURNAL OF CLASSIFICATION, 2025, 42 (01) : 205 - 220