A gravitational density-based mass sharing method for imbalanced data classification

被引:4
|
作者
Rahmati, Farshad [1 ]
Nezamabadi-pour, Hossein [1 ]
Nikpour, Bahareh [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Elect Engn, IDPL, Kerman, Iran
关键词
Classification; Class imbalance; Fixed radius search; Gravitational rule; Instance weighting; Mass sharing; NEAREST; CLASSIFIERS; ALGORITHMS; MACHINE; SMOTE;
D O I
10.1007/s42452-020-2039-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classification is one of the most popular branches of supervised learning algorithms. In the real-world problems, there are some situations in which distributions of the two classes are not the same. This situation is known as a class imbalanced problem. In the past years, several investigations have been done to find a way of handling imbalanced data, which most of them stay in one of two groups, including internal techniques and external techniques. The proposed gravitational density-based mass sharing method (GDMS) is an internal method that is designed based on k-nearest neighbor and fixed radius nearest neighbor (FRNN) rules. GDMS is a new technique that assigns masses to instances based on their local density while considering the global information too. In the labeling phase, GDMS decides based on the sum of gravitational forces coming from the candidates set, which are defined by FRNN rule. The GDMS does not need any parameters to be set in the whole procedure of classification, which is an advantage in comparison with the previous methods. To demonstrate the effectiveness of our proposed method, we use 40 standard datasets from the KEEL repository. The results show the effectiveness and superiority of GDMS compared to the competing methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A gravitational density-based mass sharing method for imbalanced data classification
    Farshad Rahmati
    Hossein Nezamabadi-pour
    Bahareh Nikpour
    [J]. SN Applied Sciences, 2020, 2
  • [2] A Density-Based Random Forest for Imbalanced Data Classification
    Dong, Jia
    Qian, Quan
    [J]. FUTURE INTERNET, 2022, 14 (03):
  • [3] CDBH: A clustering and density-based hybrid approach for imbalanced data classification
    Mirzaei, Behzad
    Nikpour, Bahareh
    Nezamabadi-pour, Hossein
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [4] LDAS: Local density-based adaptive sampling for imbalanced data classification
    Yan, Yuanting
    Jiang, Yifei
    Zheng, Zhong
    Yu, Chengjin
    Zhang, Yiwen
    Zhang, Yanping
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [5] Nearest neighbors and density-based undersampling for imbalanced data classification with class overlap
    Sun, Peiqi
    Du, Yanhui
    Xiong, Siyun
    [J]. NEUROCOMPUTING, 2024, 609
  • [6] Supervised Density-Based Metric Learning Based on Bhattacharya Distance for Imbalanced Data Classification Problems
    Jalali Mojahed, Atena
    Moattar, Mohammad Hossein
    Ghaffari, Hamidreza
    [J]. Big Data and Cognitive Computing, 2024, 8 (09)
  • [7] A new instance density-based synthetic minority oversampling method for imbalanced classification problems
    Ma, Chung-Kang
    Park, You-Jin
    [J]. ENGINEERING OPTIMIZATION, 2022, 54 (10) : 1743 - 1757
  • [8] Density-Based Discriminative Nonnegative Representation Model for Imbalanced Classification
    Li, Yanting
    Wang, Shuai
    Jin, Junwei
    Tao, Hongwei
    Nan, Jiaofen
    Wu, Huaiguang
    Chen, C. L. Philip
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [9] Density-Based Discriminative Nonnegative Representation Model for Imbalanced Classification
    Yanting Li
    Shuai Wang
    Junwei Jin
    Hongwei Tao
    Jiaofen Nan
    Huaiguang Wu
    C. L. Philip Chen
    [J]. Neural Processing Letters, 56
  • [10] Two density-based sampling approaches for imbalanced and overlapping data
    Mayabadi, Sima
    Saadatfar, Hamid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 241